Dmmm1997/C3VG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dmmm1997/C3VG)

Dmmm1997 / C3VG

[AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints

☆45

Alternatives and similar repositories for C3VG

Users that are interested in C3VG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dmmm1997 / MomentSeg
View on GitHub
[ECCV2026] MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding
☆24Jun 19, 2026Updated last month
Dmmm1997 / PropVG
View on GitHub
[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
☆32Oct 13, 2025Updated 9 months ago
Dmmm1997 / DeRIS
View on GitHub
[ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
☆48Nov 21, 2025Updated 8 months ago
Dmmm1997 / InstanceVG
View on GitHub
[TPAMI2025] Improving Generalized Visual Grounding with Instance-aware Joint Learning
☆33Apr 28, 2026Updated 2 months ago
Dmmm1997 / DRL
View on GitHub
[PR2026] Drone Referring Localization: An Efficient Heterogeneous Spatial Feature Interaction Method For UAV Self-Localization
☆94Feb 19, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 8 months ago
cjhing / OS-FPI
View on GitHub
Official repository of OS-FPI
☆17Dec 22, 2024Updated last year
Dmmm1997 / DenseUAV
View on GitHub
「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments
☆228Dec 12, 2025Updated 7 months ago
pumpkin805 / FALIP
View on GitHub
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆18Sep 11, 2024Updated last year
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago
Shuyu-Hu / QDFL
View on GitHub
☆15Apr 15, 2025Updated last year
LeapLabTHU / GSVA
View on GitHub
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
☆166Sep 12, 2024Updated last year
linhuixiao / OneRef
View on GitHub
[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆32Nov 13, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
linhuixiao / HiVG
View on GitHub
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆65Nov 10, 2025Updated 8 months ago
YuanJiayuuu / SWA-PF
View on GitHub
☆31Sep 22, 2025Updated 9 months ago
jcwang0602 / MLLMSeg
View on GitHub
MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
☆56Jun 12, 2026Updated last month
Tangkfan / Awesome-Temporal-Video-Grounding
View on GitHub
paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…
☆43Dec 27, 2025Updated 6 months ago
yifeisu / TG-GAT
View on GitHub
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.
☆21Jan 2, 2024Updated 2 years ago
jcwang0602 / PLVL
View on GitHub
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
☆13May 9, 2025Updated last year
linhuixiao / Awesome-Visual-Grounding
View on GitHub
[TPAMI 2025] Towards Visual Grounding: A Survey
☆322Nov 18, 2025Updated 8 months ago
CongHan0808 / DeOP
View on GitHub
Open-vocabulary Semantic Segmentation
☆33Feb 16, 2024Updated 2 years ago
yxchng / mask-grounding
View on GitHub
[CVPR2024] Mask Grounding for Referring Image Segmentation
☆29Jul 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
sydai / referring-expression-counting
View on GitHub
☆28Feb 21, 2025Updated last year
hanghuacs / FineCaption
View on GitHub
☆39Jun 20, 2025Updated last year
mvrl / GOMAA-Geo
View on GitHub
[NeurIPS'24] PyTorch implementation of GOMAA-Geo: GOal Modality Agnostic Active Geo-localization
☆37Oct 2, 2024Updated last year
xjwu1024 / WPS-SAM
View on GitHub
Official PyTorch implementation of WPS from our paper: WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models
☆14Jun 12, 2025Updated last year
pipilurj / perceptionGPT
View on GitHub
☆18Aug 7, 2024Updated last year
LiBingyu01 / FGA-seg
View on GitHub
Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation
☆16Mar 28, 2026Updated 3 months ago
nnnth / UFO
View on GitHub
[NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…
☆280Nov 5, 2025Updated 8 months ago
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Mabel0403 / CAMP
View on GitHub
[🎉IEEE TGRS'24] The official code for paper "CAMP: A Cross-View Geo-Localization Method using Contrastive Attributes Mining and Position…
☆35Jul 11, 2025Updated last year
TalonX1 / ProxyRecon
View on GitHub
☆10Jun 4, 2024Updated 2 years ago
guobaoxiao / DSAM
View on GitHub
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection, ACM Multimedia (MM), 2024
☆25Oct 15, 2024Updated last year
zhanghr2001 / VCoT-Grasp
View on GitHub
Official repository for VCoT-Grasp.
☆22Nov 18, 2025Updated 8 months ago
Shaosifan / FIANet
View on GitHub
[IEEE TGRS 2025] Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation
☆45Nov 14, 2025Updated 8 months ago
SHI-Labs / Slow-Fast-Video-Multimodal-LLM
View on GitHub
☆29Apr 8, 2025Updated last year
jiaqihuang01 / DETRIS
View on GitHub
[AAAI-2025] The official code of Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
☆74May 21, 2025Updated last year