[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
☆32Oct 13, 2025Updated 4 months ago
Alternatives and similar repositories for PropVG
Users that are interested in PropVG are comparing it to the libraries listed below
Sorting:
- [ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy☆41Nov 21, 2025Updated 3 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 7 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆100Oct 29, 2025Updated 4 months ago
- [PR2026] Drone Referring Localization: An Efficient Heterogeneous Spatial Feature Interaction Method For UAV Self-Localization☆85Feb 19, 2026Updated last week
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 2 months ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆28Jan 13, 2026Updated last month
- Bird's Eye View Calibration Toolkit☆17Jun 21, 2025Updated 8 months ago
- ☆18Nov 24, 2025Updated 3 months ago
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆17Nov 11, 2025Updated 3 months ago
- FunASR安卓端侧离线版本2pass全模式☆14Sep 4, 2023Updated 2 years ago
- MutiModel paper reading (Visual, Audio)☆21Nov 24, 2025Updated 3 months ago
- ☆11Dec 11, 2024Updated last year
- ☆14Dec 14, 2025Updated 2 months ago
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- GLCONet: Learning Multisource Perception Representation for Camouflaged Object Detection (TNNLS, 2024)☆16Jul 10, 2025Updated 7 months ago
- ☆32Feb 8, 2026Updated 2 weeks ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆20Sep 5, 2025Updated 5 months ago
- Offical repository of DriveWorld-VLA☆26Feb 1, 2026Updated 3 weeks ago
- [AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆106Dec 3, 2025Updated 2 months ago
- ☆19Jun 22, 2024Updated last year
- A simple, elegant web tool that allows you to create custom RSS feeds for arXiv search queries. Stay up-to-date with the latest research …☆33Dec 5, 2025Updated 2 months ago
- Includes the code for training and testing the CountGD++ model from the paper CountGD++: Generalized Prompting for Open-World Counting.☆30Feb 21, 2026Updated last week
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆15Mar 11, 2025Updated 11 months ago
- 交通大模型☆21Jul 9, 2025Updated 7 months ago
- Official repository of OS-FPI☆16Dec 22, 2024Updated last year
- ☆15Jul 11, 2025Updated 7 months ago
- [NeurIPS 2025] TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration☆19Nov 30, 2025Updated 3 months ago
- ☆19Dec 25, 2024Updated last year
- ☆16Dec 12, 2024Updated last year
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- 数据分析与处理实践 (包括:#基本数据预处理操作;#机器学习基本算法实现。)☆17Aug 23, 2018Updated 7 years ago
- Resources for few-shot reasoning tutorial☆15Oct 16, 2023Updated 2 years ago
- 4D-ROLLS: 4D Radar Occupancy Learning via Lidar Supervision☆15Mar 1, 2025Updated 11 months ago
- ☆16Mar 26, 2025Updated 11 months ago
- This is the official implementation of "OpenREAD:Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic"☆41Dec 11, 2025Updated 2 months ago
- [ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …☆16Sep 15, 2023Updated 2 years ago
- The dataset includes quantities vehicle trajectories at several sites. The dataset is extracted from aerial videos. Human work is used to…☆36Oct 8, 2025Updated 4 months ago
- Hyper-networks for Unified Visual Representation (HUVR) use implicit neural representation to bridge the gap between understanding and ge…☆24Jan 23, 2026Updated last month
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆21Apr 16, 2025Updated 10 months ago