[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆31Nov 13, 2025Updated 6 months ago
Alternatives and similar repositories for OneRef
Users that are interested in OneRef are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Jun 11, 2025Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆103Oct 29, 2025Updated 7 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 9 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆134Nov 10, 2025Updated 7 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆313Nov 18, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆31Mar 25, 2024Updated 2 years ago
- Code of paper "EDMB: Edge Detector with Mamba"☆18May 29, 2026Updated 2 weeks ago
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆29Jul 22, 2024Updated last year
- ☆18May 18, 2026Updated 3 weeks ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆45Jul 2, 2025Updated 11 months ago
- A simple visual test-time scaling method for GUI agent grounding☆26Dec 7, 2025Updated 6 months ago
- ☆18Aug 7, 2024Updated last year
- [SENSORS 2025] PicoSAM2 and PicoSAM3 are segmentation models running in-sensor on the Sony IMX500.☆45Apr 27, 2026Updated last month
- The code for paper "Understanding Negative Proposals in Generic Few-Shot Object Detection"☆17Nov 21, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CVPR2024 highlight.☆13Oct 10, 2024Updated last year
- MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder☆55Jun 2, 2026Updated last week
- ☆11Mar 11, 2025Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆81Oct 25, 2024Updated last year
- ☆10Dec 3, 2024Updated last year
- Transactions on Multimedia (TMM25)☆21Apr 8, 2025Updated last year
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated last year
- ☆28Feb 13, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆24Jul 8, 2023Updated 2 years ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆354Nov 6, 2025Updated 7 months ago
- Evaluation code for Ref-L4, a new REC benchmark in the LMM era☆61Dec 28, 2024Updated last year
- Framework for computationally efficient training of universal image feature extraction models.☆21Aug 19, 2024Updated last year
- Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision☆46Oct 19, 2025Updated 7 months ago
- ☆54Dec 23, 2024Updated last year
- [ICLR 2024 Spotlight] R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning☆77Nov 18, 2024Updated last year
- Tiny Object Detection in Remote Sensing Images Based on Object Reconstruction and Multiple Receptive Field Adaptive Feature Enhancement (…☆31Feb 10, 2026Updated 4 months ago
- ☆18Nov 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…☆137Mar 20, 2024Updated 2 years ago
- Awesome autoregressive vision foundation models☆26Dec 24, 2024Updated last year
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆105Sep 18, 2023Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆18Sep 11, 2024Updated last year
- A complete daily plan for studying to become a Google software engineer.☆10Oct 6, 2016Updated 9 years ago
- ☆17Feb 23, 2025Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆14Dec 1, 2024Updated last year