[ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"
☆10Sep 28, 2024Updated last year
Alternatives and similar repositories for VIP
Users that are interested in VIP are comparing it to the libraries listed below
Sorting:
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- [ECCV2022] Official PyTorch implementation of the paper "Outpainting by Queries"☆51Nov 27, 2022Updated 3 years ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆194May 31, 2024Updated last year
- [ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.o…☆91Jan 6, 2026Updated 2 months ago
- (ICCV 2025) OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation☆15Oct 11, 2025Updated 5 months ago
- [ICML 2025] A plug-and-play training paradigm for accelerated sampling of diffusion models, featuring minimal learnable parameters and tr…☆16Jun 29, 2025Updated 8 months ago
- Very Long Natural Scenery Image Prediction by Outpainting, ICCV2019, TensorFlow☆92Feb 2, 2021Updated 5 years ago
- ☆29Dec 10, 2022Updated 3 years ago
- A casual PyTorch implementation of Wide-Context Semantic Image Extrapolation paper☆15Oct 27, 2020Updated 5 years ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- ☆15May 13, 2024Updated last year
- ☆20Feb 3, 2025Updated last year
- [USENIX Security'24] REMARK-LLM: A robust and efficient watermarking framework for generative large language models☆27Oct 23, 2024Updated last year
- Saliency-guided Visual Attention Modeling. #RSS2022 #SOD #RobotVision☆31Jun 13, 2022Updated 3 years ago
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆15Jan 13, 2026Updated 2 months ago
- Code repository is for "Federated Composite Optimization", to appear in ICML 2021☆12May 6, 2022Updated 3 years ago
- Implementation of paper Generalised Image Outpainting with UTransformer☆22Mar 26, 2024Updated last year
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 9 months ago
- [ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs☆61Feb 27, 2025Updated last year
- ☆16Jan 10, 2025Updated last year
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- [AAAI 2024] M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking☆15Apr 29, 2024Updated last year
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- Implementation of the paper Inpainting Holes in Folded Fabric Meshes☆11Aug 18, 2023Updated 2 years ago
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆19Feb 11, 2025Updated last year
- ☆10Oct 7, 2019Updated 6 years ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Nov 21, 2023Updated 2 years ago
- [ECCV2020] Robust Tracking against Adversarial Attacks☆16Sep 9, 2021Updated 4 years ago
- DMNet for Few-shot Segmentation☆31Nov 10, 2023Updated 2 years ago
- ☆14Jul 14, 2023Updated 2 years ago
- ☆23Jan 24, 2026Updated last month
- This is the implementation of our paper "HighlightNet: Highlighting LowLight Potential Features for Real-Time UAV Tracking".☆18Mar 14, 2022Updated 4 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- [ECCV 2024] MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance☆20Jun 18, 2025Updated 9 months ago
- The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)☆20Feb 7, 2024Updated 2 years ago
- A framework for change detection using PyTorch☆169Jan 9, 2023Updated 3 years ago
- ☆12Dec 26, 2021Updated 4 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year