linhuixiao/OneRef

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linhuixiao/OneRef)

linhuixiao / OneRef

[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.

☆32

Alternatives and similar repositories for OneRef

Users that are interested in OneRef are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linhuixiao / HiVG
View on GitHub
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆65Nov 10, 2025Updated 8 months ago
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 8 months ago
Mr-Bigworth / MMCA
View on GitHub
Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)
☆26Jun 11, 2025Updated last year
jcwang0602 / PLVL
View on GitHub
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
☆13May 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
linhuixiao / Awesome-Visual-Grounding
View on GitHub
[TPAMI 2025] Towards Visual Grounding: A Survey
☆322Nov 18, 2025Updated 8 months ago
linhuixiao / CLIP-VG
View on GitHub
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
☆135Nov 10, 2025Updated 8 months ago
Dmmm1997 / C3VG
View on GitHub
[AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
☆45Jul 2, 2025Updated last year
tobran / ONE-PIC
View on GitHub
☆17Jul 23, 2024Updated 2 years ago
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 7 months ago
cv516Buaa / OV-VG
View on GitHub
☆31Mar 25, 2024Updated 2 years ago
callsys / DynRefer
View on GitHub
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
☆59Mar 4, 2025Updated last year
jcwang0602 / MLLMSeg
View on GitHub
MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
☆57Jun 12, 2026Updated last month
yxchng / mask-grounding
View on GitHub
[CVPR2024] Mask Grounding for Referring Image Segmentation
☆29Jul 22, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
caibolun / AVEC-BDS2018
View on GitHub
Multi-modality Hierarchical Recall based on GBDTs for Bipolar Disorder Classification
☆10Jul 12, 2023Updated 3 years ago
dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
Dmmm1997 / PropVG
View on GitHub
[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
☆32Oct 13, 2025Updated 9 months ago
THU-MIG / YOLO-UniOW
View on GitHub
YOLO-UniOW: Efficient Universal Open-World Object Detection
☆188Jan 17, 2025Updated last year
JierunChen / Ref-L4
View on GitHub
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
☆61Dec 28, 2024Updated last year
Li-yachuan / EDMB
View on GitHub
Code of paper "EDMB: Edge Detector with Mamba"
☆18May 29, 2026Updated last month
saikrishna-prathapaneni / LowDINO
View on GitHub
☆12Aug 19, 2023Updated 2 years ago
pipilurj / perceptionGPT
View on GitHub
☆18Aug 7, 2024Updated last year
li-jl16 / LORS
View on GitHub
CVPR2024 highlight.
☆13Oct 10, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hansxsourse / VRMDiff
View on GitHub
☆11Mar 11, 2025Updated last year
asthanameghna / Relightable-BRDF-NeRF
View on GitHub
We propose to tackle the multiview photometric stereo problem using an extension of Neural Radiance Fields (NeRFs), conditioned on light …
☆11Jan 11, 2023Updated 3 years ago
LeapLabTHU / GSVA
View on GitHub
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
☆166Sep 12, 2024Updated last year
irfan112 / yowov3-multistreaming-inferencing
View on GitHub
A real-time inferencing of multistreaming YOWOv3(Spatio Temporal Action Detection task) using (UCF101-24) dataset. The repo is extension …
☆26May 15, 2026Updated 2 months ago
bcaitech1 / p4-fr-sorry-math-but-love-you
View on GitHub
a math-formula image recognition project which placed at the first place in a competition hosted by NAVER CONNECT boostcamp AI Tech
☆10Dec 16, 2023Updated 2 years ago
lerogo / aaai24_itr_cusa
View on GitHub
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆55Mar 28, 2024Updated 2 years ago
ylingfeng / Add-SD
View on GitHub
Official implementation of Add-SD: Rational Generation without Manual Reference.
☆28Aug 19, 2024Updated last year
VisionXLab / DVGBench
View on GitHub
[ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models
☆30Mar 24, 2026Updated 3 months ago
nianfd / RWKV-VG
View on GitHub
☆10Dec 3, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MengyuanChen21 / CVPR2023-CMPAE
View on GitHub
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
☆37Jun 17, 2023Updated 3 years ago
NMS05 / Patch-Aligned-Contrastive-Learning
View on GitHub
☆24Jul 8, 2023Updated 3 years ago
MengyuanChen21 / NeurIPS2024-CSP
View on GitHub
[NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models
☆40Oct 17, 2024Updated last year
xiaoxiaotao / person-detection
View on GitHub
TensorRT person tracking RFBNet300
☆30Mar 5, 2020Updated 6 years ago
EdVince / PiDiNet-NCNN
View on GitHub
PiDiNet running in Android by ncnn
☆15Sep 26, 2021Updated 4 years ago
OpenGVLab / PIIP
View on GitHub
[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)
☆113Aug 5, 2025Updated 11 months ago
Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆358Nov 6, 2025Updated 8 months ago