Jiahao000 / MosaicFusion
[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
☆119Updated 3 months ago
Alternatives and similar repositories for MosaicFusion:
Users that are interested in MosaicFusion are comparing it to the libraries listed below
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆135Updated 8 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆118Updated 5 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 8 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated this week
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆170Updated last month
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆90Updated 10 months ago
- ☆163Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆90Updated 9 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 8 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆104Updated 8 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated 2 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆84Updated 9 months ago
- ☆34Updated last year
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆98Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆105Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆45Updated 3 months ago
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆122Updated last year
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆51Updated 9 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆118Updated last month
- Code release for LayoutDiffuse☆52Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆116Updated last week
- [ICLR2025]☆131Updated this week
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆74Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆93Updated 10 months ago
- Vico: Compositional Video Generation as Flow Equalization☆56Updated 2 months ago
- ☆47Updated last month
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆39Updated 3 weeks ago