Dmmm1997/PropVG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dmmm1997/PropVG)

Dmmm1997 / PropVG

[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

☆32

Alternatives and similar repositories for PropVG

Users that are interested in PropVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dmmm1997 / MomentSeg
View on GitHub
[ECCV2026] MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding
☆24Jun 19, 2026Updated last month
Dmmm1997 / DeRIS
View on GitHub
[ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
☆48Nov 21, 2025Updated 8 months ago
Dmmm1997 / C3VG
View on GitHub
[AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
☆45Jul 2, 2025Updated last year
Dmmm1997 / InstanceVG
View on GitHub
[TPAMI2025] Improving Generalized Visual Grounding with Instance-aware Joint Learning
☆33Apr 28, 2026Updated 2 months ago
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Dmmm1997 / DenseUAV
View on GitHub
「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments
☆228Dec 12, 2025Updated 7 months ago
Dmmm1997 / FSRA
View on GitHub
「TCSVT2021」A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization
☆124Mar 7, 2024Updated 2 years ago
pumpkin805 / FALIP
View on GitHub
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆18Sep 11, 2024Updated last year
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago
Tangkfan / Awesome-Temporal-Video-Grounding
View on GitHub
paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…
☆43Dec 27, 2025Updated 6 months ago
PinxueGuo / X-Prompt
View on GitHub
☆17Oct 4, 2024Updated last year
dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
sydai / referring-expression-counting
View on GitHub
☆28Feb 21, 2025Updated last year
Markin-Wang / CAMANet
View on GitHub
[IJBHI 2024] This is the official implementation of CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generati…
☆11May 14, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Xuchen-Li / Awesome-Vision-Language-Tracking
View on GitHub
A vision-language tracking paper list, articles related to visual language tracking have been documented.
☆46Dec 15, 2024Updated last year
SooLab / SimCIS
View on GitHub
[CVPR2025] Rethinking Query-based Transformer for Continual Image Segmentation
☆50Jul 16, 2025Updated last year
SHI-Labs / Slow-Fast-Video-Multimodal-LLM
View on GitHub
☆29Apr 8, 2025Updated last year
YuanJiayuuu / SWA-PF
View on GitHub
☆31Sep 22, 2025Updated 9 months ago
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
GLUS-video / GLUS
View on GitHub
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…
☆70Jun 23, 2025Updated last year
linhuixiao / OneRef
View on GitHub
[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆32Nov 13, 2025Updated 8 months ago
CASIA-IVA-Lab / MRES
View on GitHub
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…
☆74Jun 3, 2024Updated 2 years ago
MorningStarOvO / ProAPO
View on GitHub
Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…
☆33Nov 9, 2025Updated 8 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Mozhgan91 / LEO
View on GitHub
LEO: A powerful Hybrid Multimodal LLM
☆20Jan 18, 2025Updated last year
LeapLabTHU / GSVA
View on GitHub
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
☆166Sep 12, 2024Updated last year
OpenMICG / AHP
View on GitHub
Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation
☆15Jan 25, 2025Updated last year
XLearning-SCU / 2026-CVPR-BML
View on GitHub
[CVPR 2026] Pytorch Code for the paper "Bootstrapping Multi-view Learning for Test-time Noisy Correspondence"
☆15Jul 1, 2026Updated 2 weeks ago
hustvl / LENS
View on GitHub
[AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning
☆136Dec 3, 2025Updated 7 months ago
bytedance / DQ-Det
View on GitHub
Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
☆38Sep 12, 2023Updated 2 years ago
letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated last year
EliSpectre / MM-Mem
View on GitHub
[ACL-26 (main)] From Verbatim to Gist Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video A…
☆39Apr 19, 2026Updated 3 months ago
ZhenyuLU-Heliodore / CoPRS
View on GitHub
Project Page for ICLR'26: CoPRS, offering training overview, inference code, and downloadable links.
☆22Mar 17, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DeepMed-Lab-ECNU / BiGen
View on GitHub
Historical Report Guided Bi-modal Concurrent Learning for Pathology Report Generation
☆15Nov 24, 2025Updated 7 months ago
FudanCVL / SAAS
View on GitHub
[AAAI 2026] Segment Anything Across Shots: A Method and Benchmark
☆29Nov 16, 2025Updated 8 months ago
Huster-Hq / DADA
View on GitHub
[MICCAI 2025 Early Accept] Targeted False Positive Synthesis via Detector-guided Adversarial Diffusion Attacker for Robust Polyp Detectio…
☆15Dec 5, 2025Updated 7 months ago
junwenxiong / diff_sal
View on GitHub
Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
☆29May 26, 2024Updated 2 years ago
zhoustan / CamSAM2
View on GitHub
[NeurIPS 2025] CamSAM2: Segment Anything Accurately in Camouflaged Videos
☆21Nov 19, 2025Updated 8 months ago
suikei-wang / RESAnything
View on GitHub
[NeurIPS 2025] RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
☆19May 26, 2026Updated last month
Ze-Yang / LGKD
View on GitHub
[ICCV 2023] Official implementation for "Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Poin…
☆32Jan 26, 2024Updated 2 years ago