Jiahao000 / MosaicFusionLinks
[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
☆128Updated last year
Alternatives and similar repositories for MosaicFusion
Users that are interested in MosaicFusion are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆125Updated last year
- ☆35Updated last year
- 1-shot image segmentation using Stable Diffusion☆141Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆135Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Updated 2 months ago
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆180Updated 5 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Updated last year
- ☆32Updated 3 weeks ago
- Open-vocabulary Object Segmentation with Diffusion Models☆181Updated 2 years ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆100Updated 7 months ago
- Training code for CLIP-FlanT5☆30Updated last year
- ☆178Updated 2 years ago
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆68Updated last year
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆65Updated 2 years ago
- ☆56Updated 6 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆108Updated last year
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆88Updated 3 months ago
- [ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.o…☆83Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆45Updated last year
- ☆20Updated 2 years ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆82Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- Diffusion Models as Data Mining Tools☆54Updated 5 months ago
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆128Updated last year
- Simple script to parallelize download and extract files for SA-1B Dataset.☆37Updated 3 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆111Updated last year
- ☆112Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆75Updated 11 months ago