ucasyjz / VIPLinks
[ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"
☆10Updated last year
Alternatives and similar repositories for VIP
Users that are interested in VIP are comparing it to the libraries listed below
Sorting:
- Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')☆66Updated 2 years ago
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆24Updated last year
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆59Updated last year
- OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)☆34Updated 5 months ago
- ☆11Updated last year
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆59Updated last year
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆52Updated last year
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆15Updated 8 months ago
- Decoupled Textual Embeddings for Customized Image Generation (AAAI 2024)☆30Updated last year
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆27Updated 4 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Updated last year
- ☆21Updated this week
- ☆19Updated last year
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆23Updated 10 months ago
- [CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.☆54Updated 11 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Updated last year
- ☆19Updated 2 years ago
- MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]☆74Updated 2 weeks ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"☆14Updated 7 months ago
- Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"☆46Updated last year
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆38Updated 7 months ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆81Updated last year
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆63Updated last year
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆83Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆46Updated 2 years ago
- [ICCV 2023] The datasets and code used in our paper "Foreground Object Search by Distilling Composite Image Feature", ICCV2023.☆22Updated last week
- ☆22Updated 2 years ago
- ☆41Updated last year
- Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"☆130Updated 11 months ago