[ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
☆104Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for compose-and-conquer
Users that are interested in compose-and-conquer are comparing it to the libraries listed below
Sorting:
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆16Jan 16, 2024Updated 2 years ago
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆350Mar 16, 2025Updated 11 months ago
- ☆35Jan 23, 2024Updated 2 years ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆31Jan 24, 2025Updated last year
- ☆105Sep 4, 2024Updated last year
- ☆26Jul 17, 2025Updated 7 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- Lifting ControlNet for Generalized Depth Conditioning☆483Dec 7, 2023Updated 2 years ago
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆32Dec 8, 2023Updated 2 years ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Oct 17, 2025Updated 4 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆120Nov 14, 2024Updated last year
- ☆54Sep 27, 2024Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆758Nov 16, 2023Updated 2 years ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆298Aug 29, 2025Updated 6 months ago
- repository for 360 panorama image generation based on Stable Diffusion☆312May 20, 2024Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆53Mar 28, 2025Updated 11 months ago
- [AAAI'2024] IT3D: Improved Text-to-3D Generation with Explicit View Synthesis☆220Dec 13, 2023Updated 2 years ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,844Feb 1, 2025Updated last year
- ☆94Apr 21, 2025Updated 10 months ago
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆506Oct 7, 2025Updated 4 months ago
- Self-supervised Learning to Bring Dual Reversed Rolling Shutter Images Alive (ICCV2023)☆15Jul 6, 2024Updated last year
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- [ICLR 2024] Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping☆84Jan 18, 2024Updated 2 years ago
- [NeurIPS 2023] Official implementation of SyncDiffusion☆169Apr 20, 2024Updated last year
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆478Oct 21, 2024Updated last year
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Oct 23, 2023Updated 2 years ago
- Codes for ID-Specific Video Customized Diffusion☆462Feb 22, 2024Updated 2 years ago
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆243Mar 20, 2024Updated last year
- CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control☆172Dec 2, 2024Updated last year
- [CVPR 2024] Official implementation of FreeDrag: Feature Dragging for Reliable Point-based Image Editing☆422Apr 13, 2025Updated 10 months ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆354Sep 24, 2024Updated last year
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Jan 30, 2024Updated 2 years ago
- ☆17Nov 10, 2023Updated 2 years ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 8 months ago
- Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]☆298Mar 4, 2024Updated 2 years ago
- [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.☆510Mar 7, 2024Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Nov 25, 2024Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year