[AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
☆44Jul 2, 2025Updated 9 months ago
Alternatives and similar repositories for C3VG
Users that are interested in C3VG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Oct 13, 2025Updated 6 months ago
- [ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy☆44Nov 21, 2025Updated 5 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆103Oct 29, 2025Updated 6 months ago
- Official repository of OS-FPI☆17Dec 22, 2024Updated last year
- 「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments☆216Dec 12, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- [arXiv] Without Paired Labeled Data: End-to-End Self-Supervised Method for Drone-View Geo-Localization☆81Apr 16, 2026Updated 2 weeks ago
- [TCSVT'24] Enhancing Cross-View Geo-Localization with Domain Alignment and Scene Consistency☆32May 7, 2025Updated 11 months ago
- ☆30Mar 12, 2026Updated last month
- This repository contains the dataset link and the code for our paper MCCG: A ConvNeXt-based Multiple-Classifier Method for Cross-view Geo…☆31Apr 9, 2024Updated 2 years ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated 11 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 7 months ago
- UGround: Towards Unified Visual Grounding with Unrolled Transformers☆22Feb 15, 2026Updated 2 months ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Apr 4, 2025Updated last year
- [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval☆32Jan 6, 2026Updated 3 months ago
- SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)☆26Apr 10, 2026Updated 3 weeks ago
- [TGRS'25] Multilevel Embedding and Alignment Network With Consistency and Invariance Learning for Cross-View Geo-Localization.☆49Feb 27, 2026Updated 2 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆306Nov 18, 2025Updated 5 months ago
- MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder☆51Aug 16, 2025Updated 8 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated last year
- Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”☆18Dec 5, 2024Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆63Nov 10, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] Official implementation of the paper "Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottle…☆18Jun 29, 2025Updated 10 months ago
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆87May 13, 2025Updated 11 months ago
- LEO: A powerful Hybrid Multimodal LLM☆20Jan 18, 2025Updated last year
- This repository is an implementation of the ICCV 2025 paper "LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models …☆27Apr 23, 2026Updated last week
- ☆43Jan 1, 2026Updated 4 months ago
- ☆10Dec 3, 2024Updated last year
- ☆10Jan 6, 2025Updated last year
- PDNet: Toward Better One-Stage Object Detection With Prediction Decoupling, TIP 2022☆11Nov 30, 2022Updated 3 years ago
- [ACMMM 2025] Officially implement of the paper "Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image…☆20Jul 29, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆166Sep 12, 2024Updated last year
- CurriculumLoc for Visual Geo-localization☆16Nov 23, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆26Nov 17, 2025Updated 5 months ago
- (2024) The Official Repository of Paper "SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite …☆14Feb 7, 2024Updated 2 years ago
- Dynamic Selective Network for RGB-D Salient Object Detection☆12Jan 22, 2025Updated last year
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆51Sep 24, 2024Updated last year
- [NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…☆273Nov 5, 2025Updated 5 months ago