[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆30Nov 13, 2025Updated 3 months ago
Alternatives and similar repositories for OneRef
Users that are interested in OneRef are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆20Sep 5, 2025Updated 5 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆100Oct 29, 2025Updated 4 months ago
- ☆23Aug 20, 2024Updated last year
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆132Nov 10, 2025Updated 3 months ago
- ☆12Aug 19, 2023Updated 2 years ago
- ☆11Mar 11, 2025Updated 11 months ago
- [CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization☆12Jul 9, 2024Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆58Mar 4, 2025Updated 11 months ago
- ☆28Jul 22, 2024Updated last year
- ☆32Mar 25, 2024Updated last year
- Code of paper "EDMB: Edge Detector with Mamba"☆17May 6, 2025Updated 9 months ago
- We propose to tackle the multiview photometric stereo problem using an extension of Neural Radiance Fields (NeRFs), conditioned on light …☆11Jan 11, 2023Updated 3 years ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 8 months ago
- ☆16Apr 4, 2025Updated 10 months ago
- Transactions on Multimedia (TMM25)☆19Apr 8, 2025Updated 10 months ago
- ☆17Aug 7, 2024Updated last year
- ☆18Nov 15, 2024Updated last year
- Framework for computationally efficient training of universal image feature extraction models.☆21Aug 19, 2024Updated last year
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆80Oct 25, 2024Updated last year
- Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision☆42Oct 19, 2025Updated 4 months ago
- ☆53Dec 23, 2024Updated last year
- Awesome autoregressive vision foundation models☆26Dec 24, 2024Updated last year
- This is the project for 'USG'.☆36Apr 7, 2025Updated 10 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- Tiny Object Detection in Remote Sensing Images Based on Object Reconstruction and Multiple Receptive Field Adaptive Feature Enhancement (…☆30Feb 10, 2026Updated 2 weeks ago
- [ECCV 2024] Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction☆24Sep 22, 2024Updated last year
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆175Jan 17, 2025Updated last year
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆58Dec 22, 2025Updated 2 months ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆342Nov 6, 2025Updated 3 months ago
- ☆59Sep 14, 2024Updated last year
- Code for the paper entitled "Towards Driving-Oriented Metric for Lane Detection Models" (CVPR 2022)☆25Mar 19, 2022Updated 3 years ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year
- [ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".☆30Dec 8, 2024Updated last year
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆31Apr 20, 2025Updated 10 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆70Apr 7, 2024Updated last year
- ☆33Sep 27, 2024Updated last year
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆29Sep 26, 2024Updated last year
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Dec 4, 2024Updated last year