Jiahao000 / MosaicFusionLinks
[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
☆125Updated 10 months ago
Alternatives and similar repositories for MosaicFusion
Users that are interested in MosaicFusion are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆125Updated last year
- ☆34Updated last year
- 1-shot image segmentation using Stable Diffusion☆141Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆56Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆108Updated last year
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆100Updated 5 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Updated last year
- [CVPR'24 Highlight] PyTorch Implementation of Object Recognition as Next Token Prediction☆180Updated 3 months ago
- Open-vocabulary Object Segmentation with Diffusion Models☆181Updated 2 years ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆64Updated last year
- ☆175Updated 2 years ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- ☆111Updated last year
- ☆53Updated 4 months ago
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆80Updated last month
- This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)☆91Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆30Updated last year
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆101Updated last year
- ☆32Updated last year
- ☆189Updated 3 months ago
- [CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis☆153Updated 2 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆55Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated last year
- ☆34Updated last year
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆78Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated last year