π₯ [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
β187Apr 9, 2024Updated last year
Alternatives and similar repositories for SLD
Users that are interested in SLD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"β84May 18, 2024Updated last year
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusiβ¦β481Sep 9, 2024Updated last year
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paperβ168May 7, 2024Updated last year
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimizationβ77Jun 7, 2024Updated last year
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".β18Jan 30, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β110Updated this week
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"β607Jun 17, 2025Updated 9 months ago
- β238Apr 10, 2024Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Feb 1, 2025Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluationβ334Dec 24, 2025Updated 3 months ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoningβ13Jun 7, 2025Updated 9 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,282Jul 17, 2024Updated last year
- π₯ [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospeβ¦β56Jan 22, 2026Updated 2 months ago
- Implicit Style-Content Separation using B-LoRAβ397Nov 14, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β93Sep 22, 2024Updated last year
- π₯ [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"β26Feb 9, 2025Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidanceβ266Mar 18, 2024Updated 2 years ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ299Jul 17, 2024Updated last year
- Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".β30Jul 22, 2025Updated 8 months ago
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Modelβ240May 5, 2025Updated 10 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"β103Jul 5, 2024Updated last year
- β57Apr 30, 2024Updated last year
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)β46Oct 6, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)β81Feb 22, 2024Updated 2 years ago
- PixArt-Ξ±: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesisβ3,284Oct 31, 2024Updated last year
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusionβ275Nov 12, 2024Updated last year
- β279Jul 22, 2024Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)β768Jan 26, 2024Updated 2 years ago
- [CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Modelβ772Aug 14, 2024Updated last year
- Codes for ID-Specific Video Customized Diffusionβ462Feb 22, 2024Updated 2 years ago
- [CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"β358May 28, 2024Updated last year
- β10Jun 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)β535Sep 8, 2025Updated 6 months ago
- β133Jul 17, 2024Updated last year
- π₯ [NeurIPS 2024] A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedβ¦β14Jun 21, 2025Updated 9 months ago
- π€ Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".β42May 24, 2023Updated 2 years ago
- [TIP 2026] LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Modelsβ66Aug 14, 2024Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editorβ522Apr 2, 2024Updated last year
- [πICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!β619May 1, 2025Updated 10 months ago