Jiahao000 / MosaicFusionLinks
[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
☆123Updated 8 months ago
Alternatives and similar repositories for MosaicFusion
Users that are interested in MosaicFusion are comparing it to the libraries listed below
Sorting:
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 10 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆125Updated 10 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆126Updated last year
- Open-vocabulary Object Segmentation with Diffusion Models☆179Updated last year
- [CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis☆153Updated 2 years ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆130Updated last year
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆100Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated 11 months ago
- ☆109Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated last year
- 1-shot image segmentation using Stable Diffusion☆139Updated last year
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆72Updated 2 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆107Updated last year
- Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models☆199Updated 5 months ago
- Code release for LayoutDiffuse☆55Updated 2 years ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆91Updated 2 months ago
- [CVPR2024] CapHuman: Capture Your Moments in Parallel Universes☆97Updated 7 months ago
- ☆32Updated last year
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆105Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Updated 11 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆176Updated last year
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆51Updated 6 months ago
- ☆85Updated last year
- ICCV2023-Diffusion-Papers☆108Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆120Updated this week
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆100Updated 3 months ago
- ☆181Updated last month
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆131Updated 2 months ago
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion☆264Updated 7 months ago