[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
☆137May 4, 2024Updated last year
Alternatives and similar repositories for VisorGPT
Users that are interested in VisorGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion☆275Nov 12, 2024Updated last year
- ☆15Feb 18, 2024Updated 2 years ago
- [ICCV 2021] Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization☆25Jan 29, 2022Updated 4 years ago
- CV_JOB_interview_related_file☆10Jul 3, 2022Updated 3 years ago
- [CVPR 2022] CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆139Jun 7, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Combating Mode Collapse via Manifold Entropy Estimation☆11Apr 21, 2023Updated 2 years ago
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆27May 23, 2024Updated last year
- [CVPR 2023 Highlight] StyleGene: Crossover and Mutation of Region-level Facial Genes for Kinship Face Synthesis☆43Jun 4, 2023Updated 2 years ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,903Jan 8, 2026Updated 2 months ago
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆266Mar 18, 2024Updated 2 years ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆417Feb 26, 2025Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆27Feb 10, 2026Updated last month
- A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration☆17Jul 22, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2022] C2AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentati…☆200May 4, 2024Updated last year
- ☆17Aug 8, 2024Updated last year
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆428May 14, 2024Updated last year
- ICCV2023-Diffusion-Papers☆108Sep 3, 2023Updated 2 years ago
- ☆83Aug 1, 2023Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆768Jan 26, 2024Updated 2 years ago
- Think about boundary: Fusing multi-level boundary information for landmark heatmap regression.☆16Oct 22, 2022Updated 3 years ago
- ☆32May 3, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆481Sep 9, 2024Updated last year
- ☆133Jul 17, 2024Updated last year
- This is an official pytorch implementation of 'Group-wise Inhibition based Feature Regularization for Robust Classification' (ICCV 2021 a…☆10Dec 10, 2022Updated 3 years ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆522Apr 2, 2024Updated last year
- Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation☆46Jun 1, 2024Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆175Sep 1, 2025Updated 6 months ago
- The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".☆307Oct 19, 2025Updated 5 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆113Apr 18, 2024Updated last year
- Implementation of <Symbolic Graphics Programming with Large Language Models>☆38Sep 14, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Mar 5, 2026Updated 3 weeks ago
- [ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editing☆140Aug 2, 2025Updated 7 months ago
- Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]☆297Mar 4, 2024Updated 2 years ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆504Nov 14, 2023Updated 2 years ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆607Jun 17, 2025Updated 9 months ago
- This is an official pytorch implementation of 'Effective Presentation Attack Detection Driven by Face Related Task'☆38Mar 2, 2023Updated 3 years ago