SHI-Labs/T2I-Copilot

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SHI-Labs/T2I-Copilot)

SHI-Labs / T2I-Copilot

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)

☆57

Alternatives and similar repositories for T2I-Copilot

Users that are interested in T2I-Copilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhenyuw16 / GenArtist
View on GitHub
Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"
☆168Oct 23, 2024Updated last year
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
jacklishufan / Reflect-DiT
View on GitHub
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
☆56Aug 16, 2025Updated 11 months ago
google-deepmind / proactive_t2i_agents
View on GitHub
Code release for the paper, "Proactive Agents for Text-to-Image Generation under Uncertainty"
☆76Jul 28, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shengjun-zhang / VisualGRPO
View on GitHub
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
☆44Jan 5, 2026Updated 6 months ago
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
appletea233 / EditThinker
View on GitHub
Unlocking Iterative Reasoning for Any Image Editor
☆112Jan 18, 2026Updated 6 months ago
ziqipang / ADDP
View on GitHub
[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
☆15Jul 4, 2025Updated last year
PeterYYZhang / LayerCraft
View on GitHub
Official Repo for LayerCraft
☆18May 3, 2026Updated 2 months ago
EthanG97 / ImageDoctor
View on GitHub
The official implementation for "ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning"
☆15Mar 1, 2026Updated 4 months ago
Vchitect / Evaluation-Agent
View on GitHub
[ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible
☆128Aug 10, 2025Updated 11 months ago
bcmi / Granular-GRPO
View on GitHub
[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆64Jun 1, 2026Updated last month
PKU-YuanGroup / ImgEdit
View on GitHub
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
☆328Nov 5, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VILA-Lab / i-mae
View on GitHub
i-mae Pytorch Repo
☆20Apr 6, 2024Updated 2 years ago
paulgavrikov / biases_vs_generalization
View on GitHub
Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".
☆13Jun 24, 2024Updated 2 years ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago
OneIG-Bench / OneIG-Benchmark
View on GitHub
[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…
☆120Feb 10, 2026Updated 5 months ago
Gen-Verse / Paper2Video
View on GitHub
[ICCV 2025] Preacher: Paper-to-Video Agentic System
☆50Sep 1, 2025Updated 10 months ago
umm-emma / emma
View on GitHub
Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."
☆62Dec 16, 2025Updated 7 months ago
see-say-segment / sesame
View on GitHub
🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"
☆47Jun 16, 2024Updated 2 years ago
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 5 months ago
ZiyuGuo99 / Thinking-while-Generating
View on GitHub
The first Interleaved framework for textual reasoning within the visual generation process
☆164Mar 16, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago
GAIR-NLP / thinking-with-generated-images
View on GitHub
Doodling our way to AGI ✏️ 🖼️ 🧠
☆128May 29, 2025Updated last year
Franklin-Zhang0 / ReasonGen-R1
View on GitHub
Official respository for ReasonGen-R1
☆75Jun 23, 2025Updated last year
akira-l / SEEG
View on GitHub
Code for SEEG: Semantic Energized Co-speech Gesture Generation
☆33Dec 3, 2022Updated 3 years ago
path2generalist / General-Level
View on GitHub
On Path to Multimodal Generalist: General-Level and General-Bench
☆21Jul 11, 2025Updated last year
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆83Jun 11, 2024Updated 2 years ago
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
lambert-x / VideoAuteur
View on GitHub
VideoAuteur: Towards Long Narrative Video Generation
☆44Oct 22, 2025Updated 9 months ago
Huage001 / URAE
View on GitHub
[ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".
☆118May 3, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
basiclab / Unraveling-Information-Mix-ups
View on GitHub
🔥 [NeurIPS 2024] A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embed…
☆15Jun 21, 2025Updated last year
Diffusion-CoT / ReflectionFlow
View on GitHub
[ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
☆220Nov 5, 2025Updated 8 months ago
TIGER-AI-Lab / VIEScore
View on GitHub
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…
☆68Nov 19, 2024Updated last year
ali-vilab / CDT
View on GitHub
Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
☆17Apr 2, 2025Updated last year
hxixixh / amo-release
View on GitHub
Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"
☆30May 3, 2025Updated last year
kongdai123 / consistency2
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
CodeGoat24 / LiFT
View on GitHub
Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.
☆85May 4, 2025Updated last year