hwjiang1510 / VQLoCLinks

(NeurIPS 2023) Open-set visual object query search & localization in long-form videos

☆24

Alternatives and similar repositories for VQLoC

Users that are interested in VQLoC are comparing it to the libraries listed below

Sorting:

sapeirone / EgoPack
Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…
☆24Updated last year
clownrat6 / OpenVIS
[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
☆23Updated 7 months ago
Becomebright / ReKV
Official PyTorch Code of ReKV (ICLR'25)
☆36Updated 4 months ago
franciszzj / OpenPSG
[ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
☆47Updated 7 months ago
Ranking-VMR / SPR
☆11Updated 6 months ago
Becomebright / GroundVQA
Official PyTorch code of GroundVQA (CVPR'24)
☆61Updated 10 months ago
jh-yi / Video-Panda
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]
☆72Updated last month
guanw-pku / OED
Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".
☆20Updated last year
ut-vision / ActionVOS
[ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation
☆33Updated 8 months ago
GLUS-video / GLUS
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…
☆48Updated last month
gpt4vision / OvSGTR
[ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…
☆74Updated last week
EdenGabriel / TaskWeave
[CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
☆26Updated 10 months ago
Show-han / Zeroshot_REC
Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)
☆26Updated last year
haochenheheda / LVVIS
Large-Vocabulary Video Instance Segmentation dataset
☆90Updated last year
zhousheng97 / EgoTextVQA
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
☆36Updated last month
ExplainableML / EgoCVR
[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
☆39Updated 3 months ago
ChocoWu / USG
This is the project for 'USG'.
☆22Updated 4 months ago
shvdiwnkozbw / SSL-UVOS
[ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
☆35Updated 5 months ago
Tanveer81 / ReVisionLLM
This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
☆27Updated last month
zhengrongz / AoTD
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆40Updated 2 months ago
baopj / E3M
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Updated last year
hrtang22 / MUSE
Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"
☆21Updated 6 months ago
qirui-chen / MultiHop-EgoQA
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆26Updated 2 months ago
kwonjunn01 / Hi-Mapper
☆14Updated 8 months ago
cilinyan / ReVOS-api
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆17Updated last year
OpenGVLab / TimeSuite
[ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
☆40Updated 4 months ago
RobertLuo1 / NeurIPS2023_SOC
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆32Updated last year
z-x-yang / DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆86Updated 11 months ago
ailab-kyunghee / CM2_DVC
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
☆60Updated last year
minjoong507 / BM-DETR
[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"
☆16Updated 5 months ago