[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆31Nov 13, 2025Updated 4 months ago
Alternatives and similar repositories for OneRef
Users that are interested in OneRef are comparing it to the libraries listed below
Sorting:
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Jun 11, 2025Updated 9 months ago
- ☆23Aug 20, 2024Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆101Oct 29, 2025Updated 4 months ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated 10 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆132Nov 10, 2025Updated 4 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆297Nov 18, 2025Updated 4 months ago
- ☆32Mar 25, 2024Updated last year
- Code of paper "EDMB: Edge Detector with Mamba"☆17May 6, 2025Updated 10 months ago
- [IEEE SENSORS 2025/26] PicoSAM2 and PicoSAM3 are in-sensor segmentation models compatible with the Sony IMX500☆28Mar 13, 2026Updated last week
- ☆28Jul 22, 2024Updated last year
- ☆16Apr 4, 2025Updated 11 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 8 months ago
- ☆12Aug 19, 2023Updated 2 years ago
- A simple visual test-time scaling method for GUI agent grounding☆21Dec 7, 2025Updated 3 months ago
- We propose to tackle the multiview photometric stereo problem using an extension of Neural Radiance Fields (NeRFs), conditioned on light …☆11Jan 11, 2023Updated 3 years ago
- ☆17Aug 7, 2024Updated last year
- MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder☆51Aug 16, 2025Updated 7 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year
- Transactions on Multimedia (TMM25)☆19Apr 8, 2025Updated 11 months ago
- ☆24Jul 8, 2023Updated 2 years ago
- ☆26Feb 13, 2026Updated last month
- [ICLR 2024 Spotlight] R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning☆43Nov 18, 2024Updated last year
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆37Jun 17, 2023Updated 2 years ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆345Nov 6, 2025Updated 4 months ago
- Framework for computationally efficient training of universal image feature extraction models.☆21Aug 19, 2024Updated last year
- ☆53Dec 23, 2024Updated last year
- Evaluation code for Ref-L4, a new REC benchmark in the LMM era☆60Dec 28, 2024Updated last year
- This is the project for 'USG'.☆37Apr 7, 2025Updated 11 months ago
- This repository contains the annotations used for evaluating Unsupervised Domain Adaptation on EPIC Kitchens, with individual kitchens us…☆13Jun 2, 2020Updated 5 years ago
- Tiny Object Detection in Remote Sensing Images Based on Object Reconstruction and Multiple Receptive Field Adaptive Feature Enhancement (…☆30Feb 10, 2026Updated last month
- A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…☆137Mar 20, 2024Updated 2 years ago
- ☆18Nov 15, 2024Updated last year
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆105Sep 18, 2023Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- ☆11Apr 4, 2025Updated 11 months ago
- [ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization☆49Apr 19, 2024Updated last year
- ☆13Feb 12, 2024Updated 2 years ago
- ☆16Feb 23, 2025Updated last year
- M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision (ICCV 2025)☆30Nov 19, 2025Updated 4 months ago