ucasyjz / VIP
[ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"
☆9Updated 3 months ago
Alternatives and similar repositories for VIP:
Users that are interested in VIP are comparing it to the libraries listed below
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆20Updated 2 months ago
- [NeurIPS2024]☆13Updated last month
- ☆39Updated last year
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)☆25Updated 7 months ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆10Updated last month
- Replication in Visual Diffusion Models: A Survey and Outlook☆26Updated 5 months ago
- official repo for the paper "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"☆44Updated last year
- Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')☆51Updated last year
- Liquid: Language Models are Scalable Multi-modal Generators☆60Updated last month
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆44Updated last month
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated 9 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆27Updated last month
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆59Updated this week
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆47Updated 8 months ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated last year
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆69Updated last month
- ☆27Updated 3 months ago
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆48Updated 4 months ago
- ☆19Updated last year
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆46Updated 2 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆31Updated 7 months ago
- [ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance☆17Updated 2 months ago
- Text4Seg: Reimagining Image Segmentation as Text Generation☆40Updated 2 weeks ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆94Updated last month
- Official implementation of TagAlign☆34Updated last month
- Video Diffusion State Space Models☆19Updated 9 months ago
- Official Implementation of VideoDPO☆37Updated this week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago