OPPO-Mente-Lab / X2ILinks
Official code for ICCV 205 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
☆78Updated last month
Alternatives and similar repositories for X2I
Users that are interested in X2I are comparing it to the libraries listed below
Sorting:
- Consistency Distillation with Target Timestep Selection and Decoupled Guidance☆83Updated 7 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆250Updated 2 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆43Updated 4 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆82Updated last month
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆13Updated 4 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆208Updated 3 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆132Updated last month
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆289Updated this week
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆160Updated 8 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆225Updated last month
- ImgEdit: A Unified Image Editing Dataset and Benchmark☆155Updated last week
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆56Updated 2 months ago
- Subjects200K dataset☆114Updated 6 months ago
- ☆50Updated 7 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆62Updated 2 weeks ago
- ☆112Updated last year
- ☆50Updated 7 months ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆240Updated 3 months ago
- ☆116Updated last month
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆174Updated 4 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆158Updated 7 months ago
- [CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation☆119Updated 3 weeks ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆155Updated last month
- An Efficient Text-to-Image Generation Pretrain Pipeline☆111Updated 3 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆188Updated 7 months ago
- Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆68Updated 2 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆110Updated 3 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆124Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆38Updated last month
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆119Updated last month