chenllliang/DreamEngine

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenllliang/DreamEngine)

chenllliang / DreamEngine

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!

☆123

Alternatives and similar repositories for DreamEngine

Users that are interested in DreamEngine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chenllliang / MMEvalPro
View on GitHub
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆25Sep 26, 2024Updated last year
EchoPluto / MagicID
View on GitHub
☆35Mar 18, 2025Updated last year
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
MiZhenxing / ThinkDiff
View on GitHub
ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
☆191Sep 7, 2025Updated 10 months ago
modelscope / Nexus-Gen
View on GitHub
☆292Jul 29, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
illume-unified-mllm / ILLUME_plus
View on GitHub
[CVPR2025] Official Implementation of ILLUME+
☆126Aug 20, 2025Updated 11 months ago
TencentARC / FluxKits
View on GitHub
☆109Nov 27, 2024Updated last year
ControlGenAI / T-LoRA
View on GitHub
[AAAI 2026] This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"
☆149Apr 24, 2026Updated 2 months ago
fenghora / personalize-anything
View on GitHub
[AAAI 2026] Personalize Anything for Free with Diffusion Transformer
☆361Mar 26, 2026Updated 3 months ago
causalfusion / causalfusion
View on GitHub
☆196Dec 17, 2024Updated last year
erwold / qwen2vl-flux
View on GitHub
☆571Nov 26, 2024Updated last year
Shakker-Labs / RepText
View on GitHub
RepText: Rendering Visual Text via Replicating 🔥
☆139Jun 7, 2025Updated last year
OPPO-Mente-Lab / X2I
View on GitHub
Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…
☆89Jun 26, 2025Updated last year
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
qihao067 / CrossFlow
View on GitHub
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆343Jun 8, 2025Updated last year
camenduru / Matting-Anything-colab
View on GitHub
☆10Jul 25, 2023Updated 2 years ago
stepfun-ai / Step1X-Edit
View on GitHub
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆2,236Apr 29, 2026Updated 2 months ago
CUC-MIPG / Edit-Transfer
View on GitHub
Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"
☆89Jun 6, 2025Updated last year
Xilluill / KV-Edit
View on GitHub
[ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
☆386May 21, 2025Updated last year
showlab / PhotoDoodle
View on GitHub
[ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"
☆430Apr 23, 2025Updated last year
wtybest / FreeFlux
View on GitHub
[ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
☆77Mar 7, 2026Updated 4 months ago
csuhan / Tar
View on GitHub
[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆202Sep 18, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Alpha-VLLM / Lumina-Image-2.0
View on GitHub
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
☆1,006May 22, 2026Updated last month
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
alexanderswerdlow / unidisc
View on GitHub
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆142Apr 2, 2025Updated last year
FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆431Jun 20, 2025Updated last year
ali-vilab / IDEA-Bench
View on GitHub
Official repository of IDEA-Bench
☆41Jan 24, 2025Updated last year
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆56Oct 6, 2025Updated 9 months ago
DAMO-NLP-SG / DiGIT
View on GitHub
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆78Oct 31, 2024Updated last year
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PKU-YuanGroup / UniWorld
View on GitHub
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
☆883Dec 23, 2025Updated 6 months ago
wyhlovecpp / GPT-Image-Edit
View on GitHub
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆243Aug 15, 2025Updated 11 months ago
MizzenAI / HPSv3
View on GitHub
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)
☆325Dec 5, 2025Updated 7 months ago
junhahyung / MagiCapture
View on GitHub
☆11Feb 26, 2024Updated 2 years ago
zhaoshitian / LeX-Art
View on GitHub
Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"
☆85Aug 25, 2025Updated 10 months ago
zichongc / ComfyUI-Attention-Distillation
View on GitHub
Official Implementation of Attention Distillation for ComfyUI
☆110Mar 18, 2025Updated last year
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago