Jiahao000 / MosaicFusionLinks
[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
☆123Updated 7 months ago
Alternatives and similar repositories for MosaicFusion
Users that are interested in MosaicFusion are comparing it to the libraries listed below
Sorting:
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆124Updated 9 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆135Updated last year
- ☆105Updated 11 months ago
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆178Updated last month
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆100Updated 2 months ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆54Updated 10 months ago
- ☆34Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆130Updated last year
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆90Updated 2 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 9 months ago
- Code release for LayoutDiffuse☆55Updated 2 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- ☆171Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆124Updated 5 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆99Updated last year
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆51Updated 5 months ago
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆126Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 6 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆103Updated last year
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆105Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated 10 months ago
- [CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis☆153Updated 2 years ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆49Updated last month
- ☆51Updated last month
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆107Updated last year
- Official repository of paper "Subobject-level Image Tokenization"☆72Updated 2 months ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆248Updated 7 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆46Updated 2 months ago