OPPO-Mente-Lab/X2I

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OPPO-Mente-Lab/X2I)

OPPO-Mente-Lab / X2I

Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

☆89

Alternatives and similar repositories for X2I

Users that are interested in X2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternLM / Spark
View on GitHub
An official implementation of "SPARK: Synergistic Policy And Reward Co-Evolving Framework"
☆25Oct 23, 2025Updated 9 months ago
hzphzp / WeGen
View on GitHub
☆27Apr 25, 2025Updated last year
OPPO-Mente-Lab / X2Edit
View on GitHub
AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
☆97Nov 21, 2025Updated 8 months ago
OPPO-Mente-Lab / PEA-Diffusion
View on GitHub
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
☆37Oct 28, 2024Updated last year
OPPO-Mente-Lab / TLCM
View on GitHub
Official repo for 【TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps】
☆36Dec 27, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
Shakker-Labs / RepText
View on GitHub
RepText: Rendering Visual Text via Replicating 🔥
☆139Jun 7, 2025Updated last year
fenghora / personalize-anything
View on GitHub
[AAAI 2026] Personalize Anything for Free with Diffusion Transformer
☆361Mar 26, 2026Updated 3 months ago
DCDmllm / AnyEdit
View on GitHub
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
☆226Apr 5, 2025Updated last year
Bujiazi / DiCache
View on GitHub
[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆61Jan 26, 2026Updated 5 months ago
OPPO-Mente-Lab / GlyphDraw2
View on GitHub
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
☆87Jul 11, 2024Updated 2 years ago
bytedance / UNO
View on GitHub
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
☆1,360Sep 12, 2025Updated 10 months ago
erwold / qwen2vl-flux
View on GitHub
☆571Nov 26, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆133Jul 20, 2023Updated 3 years ago
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago
ymju-BAAI / CI-VID
View on GitHub
☆30Sep 4, 2025Updated 10 months ago
GongyeLiu / Awesome-Alignment-of-Diffusion-Models
View on GitHub
paper collection: alignment of diffusion models
☆29Mar 6, 2026Updated 4 months ago
MS-Diffusion / MS-Diffusion
View on GitHub
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
☆311Jul 30, 2025Updated 11 months ago
Xuan-World / UniCombine
View on GitHub
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
☆129Jun 27, 2025Updated last year
ip-composer / IP-Composer
View on GitHub
☆20Apr 15, 2025Updated last year
gemlab-vt / motionshop
View on GitHub
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
☆26Dec 12, 2024Updated last year
FireRedTeam / Single-Trajectory-Distillation
View on GitHub
☆27Feb 11, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Eureka-Maggie / MIGE
View on GitHub
Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
☆72Jul 13, 2025Updated last year
CaraJ7 / T2I-R1
View on GitHub
[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
☆433Sep 18, 2025Updated 10 months ago
hqhQAQ / PatchDPO
View on GitHub
[CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
☆47Jul 1, 2025Updated last year
arthur-71 / Grounded-Instruct-Pix2Pix
View on GitHub
☆14Nov 24, 2023Updated 2 years ago
finegrain-ai / comfyui-finegrain
View on GitHub
ComfyUI custom nodes to interact with the Finegrain API
☆13Sep 17, 2025Updated 10 months ago
bcmi / Granular-GRPO
View on GitHub
[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆64Jun 1, 2026Updated last month
rubi-du / ComfyUI-MaskEditor-Extension
View on GitHub
This repository extends the mask editor in Comfyui and supports lasso method for applying masks
☆14Jul 23, 2025Updated last year
singlinhaha / Comfyui_Heygem_Docker
View on GitHub
☆14Mar 27, 2025Updated last year
PKU-YuanGroup / ImgEdit
View on GitHub
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
☆327Nov 5, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OPPO-Mente-Lab / FaceScore
View on GitHub
Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】
☆84Dec 26, 2024Updated last year
bytedance / UMO
View on GitHub
[CVPR 2026] 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
☆190Sep 15, 2025Updated 10 months ago
Yuanshi9815 / OminiControl
View on GitHub
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,926Jul 2, 2026Updated 3 weeks ago
Cooperx521 / ScaleCap
View on GitHub
(ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’
☆60Jan 26, 2026Updated 5 months ago
ML-GSAI / Concat-ID
View on GitHub
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
☆65May 7, 2025Updated last year
OPPO-Mente-Lab / Qwen-Image-Pruning
View on GitHub
CVPR 2026 Highlight: Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
☆86Apr 9, 2026Updated 3 months ago
xie-lab-ml / Zigzag-Diffusion-Sampling
View on GitHub
[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectio…
☆103May 20, 2026Updated 2 months ago