showlab/VisorGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/showlab/VisorGPT)

showlab / VisorGPT

[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT

☆138

Alternatives and similar repositories for VisorGPT

Users that are interested in VisorGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

showlab / BoxDiff
View on GitHub
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
☆275Nov 12, 2024Updated last year
ryylcc / OWSOL
View on GitHub
☆15Feb 18, 2024Updated 2 years ago
Sierkinhane / ORNet
View on GitHub
[ICCV 2021] Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization
☆26Jan 29, 2022Updated 4 years ago
ziqi-jin / CV_JOB_interview_related_file
View on GitHub
CV_JOB_interview_related_file
☆10Jul 3, 2022Updated 4 years ago
CVI-SZU / CLIMS
View on GitHub
[CVPR 2022] CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation
☆138Jun 7, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HaozheLiu-ST / MEE
View on GitHub
Combating Mode Collapse via Manifold Entropy Estimation
☆11Apr 21, 2023Updated 3 years ago
Cominclip / BoxDiff-XL
View on GitHub
Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)
☆28May 23, 2024Updated 2 years ago
CVI-SZU / StyleGene
View on GitHub
[CVPR 2023 Highlight] StyleGene: Crossover and Mutation of Region-level Facial Genes for Kinship Face Synthesis
☆43Jun 4, 2023Updated 3 years ago
wmpscc / ArxivDailyOverview
View on GitHub
Automatically download and crop key information from the arxiv daily paper.
☆21Jul 30, 2022Updated 3 years ago
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,965Jan 8, 2026Updated 6 months ago
aim-uofa / VLModel
View on GitHub
Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
silent-chen / layout-guidance
View on GitHub
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
☆267Mar 18, 2024Updated 2 years ago
HaozheLiu-ST / T-GATE
View on GitHub
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
☆418Feb 26, 2025Updated last year
csmliu / pretrained-GANs
View on GitHub
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
☆17Jul 22, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CVI-SZU / CCAM
View on GitHub
[CVPR 2022] C2AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentati…
☆201May 4, 2024Updated 2 years ago
showlab / Tune-An-Ellipse
View on GitHub
[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want
☆14Jan 5, 2025Updated last year
Papple-F / csg
View on GitHub
☆17Aug 8, 2024Updated last year
TencentARC / Mix-of-Show
View on GitHub
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
☆428May 14, 2024Updated 2 years ago
showlab / ShowAnything
View on GitHub
☆83Aug 1, 2023Updated 2 years ago
Sierkinhane / ICCV2023-Diffusion-Papers
View on GitHub
ICCV2023-Diffusion-Papers
☆108Sep 3, 2023Updated 2 years ago
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
Sierkinhane / TAB
View on GitHub
Think about boundary: Fusing multi-level boundary information for landmark heatmap regression.
☆16Oct 22, 2022Updated 3 years ago
showlab / Long-form-Video-Prior
View on GitHub
☆32May 3, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
HaozheLiu-ST / Point-Beyond-Class
View on GitHub
An Official Implementation for the Paper 'Point Beyond Class: A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-…
☆18Oct 20, 2022Updated 3 years ago
TonyLianLong / LLM-groundedDiffusion
View on GitHub
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…
☆483Sep 9, 2024Updated last year
Attention-Refocusing / attention-refocusing
View on GitHub
☆133Jul 17, 2024Updated 2 years ago
Picsart-AI-Research / PAIR-Diffusion
View on GitHub
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
☆521Apr 2, 2024Updated 2 years ago
LinusWu / TENET_Training
View on GitHub
This is an official pytorch implementation of 'Group-wise Inhibition based Feature Regularization for Robust Classification' (ICCV 2021 a…
☆10Dec 10, 2022Updated 3 years ago
DavidYan2001 / Synthetic2Real-Depth
View on GitHub
[CVPR 2025] Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors
☆21Jun 6, 2025Updated last year
DavidYan2001 / PVChat
View on GitHub
[ICCV 2025] PVChat: Personalized Video Chat with One-Shot Learning
☆17Apr 4, 2026Updated 3 months ago
BizhuWu / MG-MotionLLM
View on GitHub
[CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities
☆59Feb 1, 2026Updated 5 months ago
TonyLianLong / igligen
View on GitHub
Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation
☆46Jun 1, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UCSC-VLAA / HQ-Edit
View on GitHub
[ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
☆114Apr 18, 2024Updated 2 years ago
aim-uofa / FreeCustom
View on GitHub
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
☆177Sep 1, 2025Updated 10 months ago
G-U-N / Gen-L-Video
View on GitHub
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
☆308Oct 19, 2025Updated 9 months ago
aim-uofa / AutoStory
View on GitHub
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆149Mar 5, 2026Updated 4 months ago
YangLing0818 / EditWorld
View on GitHub
[ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
☆142Aug 2, 2025Updated 11 months ago
vision-x-nyu / image-sculpting
View on GitHub
Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]
☆297Mar 4, 2024Updated 2 years ago
frank-xwang / InstanceDiffusion
View on GitHub
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
☆614Jun 17, 2025Updated last year