zhenyuw16/CompAgent_code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhenyuw16/CompAgent_code)

zhenyuw16 / CompAgent_code

Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".

☆18

Alternatives and similar repositories for CompAgent_code

Users that are interested in CompAgent_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WUyinwei-hah / RRNet
View on GitHub
[CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model
☆48Sep 13, 2024Updated last year
j-min / VPGen
View on GitHub
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆57Jul 25, 2023Updated 2 years ago
18445864529 / MAVIN
View on GitHub
Official PyTorch implementation of paper MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling
☆13Oct 5, 2024Updated last year
wfanyue / DPG-T2I-Personalization
View on GitHub
[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
☆50Jun 17, 2025Updated last year
camenduru / sliders-colab
View on GitHub
☆32Jan 25, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
apple / ml-space-benchmark
View on GitHub
Code and data for "Does Spatial Cognition Emerge in Frontier Models?"
☆29Apr 18, 2025Updated last year
poppuppy / SAR
View on GitHub
☆34Dec 29, 2025Updated 6 months ago
mlpc-ucsd / OverLayBench
View on GitHub
(NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
☆27May 4, 2026Updated 2 months ago
alipay / diffusion-model-for-instance-segmentation
View on GitHub
This repository is the code of the paper "DiffusionInst: Diffusion Model for Instance Segmentation".
☆28Jan 3, 2023Updated 3 years ago
camenduru / FreeInit-colab
View on GitHub
☆25Dec 21, 2023Updated 2 years ago
PeterGriffinJin / InstructG2I
View on GitHub
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)
☆19Oct 17, 2024Updated last year
Kwai-Klear / AR-GRPO
View on GitHub
Training Autoregressive Image Generation models via Reinforcement Learning
☆53Nov 26, 2025Updated 7 months ago
xiefan-guo / initno
View on GitHub
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
☆80Jun 7, 2024Updated 2 years ago
qinghew / StableIdentity
View on GitHub
[TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥
☆259Dec 26, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Lil-Shake / VA-Pi
View on GitHub
[CVPR 2026] This repository is the code of our paper "VA-Pi: Variational Policy Alignment for Pixel-Aware Autoregressive Generation"
☆15Mar 3, 2026Updated 4 months ago
louisYen / Gen4Gen
View on GitHub
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
☆110Mar 27, 2026Updated 3 months ago
nipunjindal / diffusers-layout-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".
☆42May 24, 2023Updated 3 years ago
tomtom1103 / compose-and-conquer
View on GitHub
[ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
☆104Jan 18, 2024Updated 2 years ago
Toloka / BestPrompts
View on GitHub
Best Prompts for Text-to-Image Models
☆25Jan 20, 2024Updated 2 years ago
hamsterbacke23 / webcamdisplay-react
View on GitHub
Shows all your connected webcam feeds right in the browser, in draggable and resizable boxes
☆13Jan 27, 2019Updated 7 years ago
lucataco / cog-hunyuanvideo-lora-trainer
View on GitHub
Memory-optimized training scripts for video models based on Diffusers
☆17Jan 3, 2025Updated last year
WuTao-CS / CustomCrafter
View on GitHub
[AAAI 2025] CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
☆52Jan 12, 2025Updated last year
SsGood / DBGAN
View on GitHub
[CVPR2020] Tensorflow implementation for paper ''Distribution-induced Bidirectional Generative Adversarial Network for Graph Representati…
☆31Nov 24, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hi-zhengcheng / vividzoo
View on GitHub
☆39Oct 19, 2024Updated last year
gwang-kim / PersonaCraft
View on GitHub
[ICCV 2025] Pytorch implementation of "PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single Reference…
☆50Mar 16, 2025Updated last year
trandangtrungduc / llama-paper-summary
View on GitHub
Code, Resources - Personal project - Llama Paper Summary - October 14, 2024.
☆11Oct 15, 2024Updated last year
sjijon / TeX-templates
View on GitHub
Various TeX templates, including slides and posters.
☆16May 19, 2022Updated 4 years ago
ChCh1999 / RTPB
View on GitHub
Code for our paper `Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation`
☆20Feb 18, 2024Updated 2 years ago
kvablack / LLaVA-server
View on GitHub
☆22Oct 20, 2023Updated 2 years ago
siml3 / RU-Net
View on GitHub
☆11Oct 6, 2022Updated 3 years ago
CharlieDDDD / AISurveyPapers
View on GitHub
Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey
☆21Jul 27, 2025Updated 11 months ago
CompVis / attribute-control
View on GitHub
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
☆136Feb 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GraphPKU / LIFT
View on GitHub
The official implementation of LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning
☆15Mar 14, 2025Updated last year
SherryXTChen / TiNO-Edit
View on GitHub
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing (CVPR 2024)
☆45Sep 12, 2025Updated 10 months ago
DavideAlidosi / sd-webui-controlnet-animatediff
View on GitHub
WebUI extension for AnimateDiff ControlNet
☆37Oct 5, 2023Updated 2 years ago
Continue7777 / ResNet_iris
View on GitHub
利用resnet_18来对虹膜图像进行模糊清晰二分类
☆10Apr 8, 2018Updated 8 years ago
tsunghan-wu / SLD
View on GitHub
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
☆186Apr 9, 2024Updated 2 years ago
TonyLianLong / LLM-groundedDiffusion
View on GitHub
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…
☆482Sep 9, 2024Updated last year
feizc / Ingredients
View on GitHub
Blending Custom Photos with Video Diffusion Transformers
☆50Jan 21, 2025Updated last year