[ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"
☆10Sep 28, 2024Updated last year
Alternatives and similar repositories for VIP
Users that are interested in VIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- [ECCV2022] Official PyTorch implementation of the paper "Outpainting by Queries"☆52Nov 27, 2022Updated 3 years ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆196May 31, 2024Updated last year
- [ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.o…☆91Jan 6, 2026Updated 3 months ago
- (ICCV 2025) OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation☆16Oct 11, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML 2025] A plug-and-play training paradigm for accelerated sampling of diffusion models, featuring minimal learnable parameters and tr…☆16Jun 29, 2025Updated 10 months ago
- Very Long Natural Scenery Image Prediction by Outpainting, ICCV2019, TensorFlow☆93Feb 2, 2021Updated 5 years ago
- ☆30Dec 10, 2022Updated 3 years ago
- A casual PyTorch implementation of Wide-Context Semantic Image Extrapolation paper☆15Oct 27, 2020Updated 5 years ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- ☆15May 13, 2024Updated last year
- ☆20Feb 3, 2025Updated last year
- Saliency-guided Visual Attention Modeling. #RSS2022 #SOD #RobotVision☆30Jun 13, 2022Updated 3 years ago
- [USENIX Security'24] REMARK-LLM: A robust and efficient watermarking framework for generative large language models☆28Oct 23, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code repository is for "Federated Composite Optimization", to appear in ICML 2021☆12May 6, 2022Updated 3 years ago
- Implementation of paper Generalised Image Outpainting with UTransformer☆22Mar 26, 2024Updated 2 years ago
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆16Jan 13, 2026Updated 3 months ago
- [ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs☆61Feb 27, 2025Updated last year
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 10 months ago
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- [AAAI 2024] M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking☆16Apr 29, 2024Updated 2 years ago
- ☆16Jan 10, 2025Updated last year
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of the paper Inpainting Holes in Folded Fabric Meshes☆11Aug 18, 2023Updated 2 years ago
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆19Feb 11, 2025Updated last year
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Nov 21, 2023Updated 2 years ago
- ☆10Oct 7, 2019Updated 6 years ago
- [ECCV2020] Robust Tracking against Adversarial Attacks☆17Sep 9, 2021Updated 4 years ago
- DMNet for Few-shot Segmentation☆32Nov 10, 2023Updated 2 years ago
- ☆14Jul 14, 2023Updated 2 years ago
- ☆23Jan 24, 2026Updated 3 months ago
- This is the implementation of our paper "HighlightNet: Highlighting LowLight Potential Features for Real-Time UAV Tracking".☆18Mar 14, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- [ECCV 2024] MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance☆20Jun 18, 2025Updated 10 months ago
- The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)☆20Feb 7, 2024Updated 2 years ago
- A framework for change detection using PyTorch☆169Jan 9, 2023Updated 3 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- ☆12Dec 26, 2021Updated 4 years ago
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year