EasonXiao-888/UVCOM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EasonXiao-888/UVCOM)

EasonXiao-888 / UVCOM

[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

☆117

Alternatives and similar repositories for UVCOM

Users that are interested in UVCOM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wjun0830 / CGDETR
View on GitHub
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…
☆154Aug 21, 2024Updated last year
EdenGabriel / TaskWeave
View on GitHub
[CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
☆30Sep 26, 2024Updated last year
mingyao1120 / TR-DETR
View on GitHub
Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Pape…
☆57Feb 22, 2025Updated last year
wjun0830 / QD-DETR
View on GitHub
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …
☆251Aug 12, 2025Updated 11 months ago
sudo-Boris / mr-Blip
View on GitHub
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆95Mar 9, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
linjieli222 / HERO_Video_Feature_Extractor
View on GitHub
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆118Jun 9, 2021Updated 5 years ago
RobertLuo1 / NeurIPS2023_SOC
View on GitHub
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆33Mar 16, 2024Updated 2 years ago
jayleicn / moment_detr
View on GitHub
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
☆349Mar 9, 2026Updated 4 months ago
TencentARC / UMT
View on GitHub
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …
☆238Apr 15, 2024Updated 2 years ago
Zhuo-Cao / FlashVTG
View on GitHub
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)
☆39Apr 17, 2025Updated last year
jinhyunj / EaTR
View on GitHub
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆55Sep 7, 2023Updated 2 years ago
showlab / UniVTG
View on GitHub
[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding
☆380May 8, 2024Updated 2 years ago
dpaul06 / VideoLights
View on GitHub
☆17Dec 4, 2024Updated last year
ailab-kyunghee / CM2_DVC
View on GitHub
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
☆66Jun 19, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
houzhijian / CONE
View on GitHub
[2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
☆31Aug 5, 2023Updated 2 years ago
line / lighthouse
View on GitHub
[EMNLP2024 Demo], [ICASSP 2025], [ICASSP 2026] A user-friendly library for reproducible video moment retrieval and highlight detection. I…
☆262Mar 26, 2026Updated 4 months ago
HengLan / CGSTVG
View on GitHub
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆66Jun 28, 2024Updated 2 years ago
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
lntzm / MESM
View on GitHub
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆32Mar 29, 2024Updated 2 years ago
afcedf / SOONet
View on GitHub
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
☆30Jun 24, 2024Updated 2 years ago
minghangz / SPL
View on GitHub
Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
☆16Jul 20, 2023Updated 3 years ago
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 11 months ago
yeliudev / R2-Tuning
View on GitHub
🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
☆91Jul 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Pilhyeon / BAM-DETR
View on GitHub
Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'
☆36Feb 26, 2025Updated last year
solicucu / D3G
View on GitHub
☆15Oct 30, 2023Updated 2 years ago
hlchen23 / VERIFIED
View on GitHub
Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…
☆40Jan 20, 2025Updated last year
huangb23 / VTimeLLM
View on GitHub
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
☆295Jun 13, 2024Updated 2 years ago
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
gyxxyg / VTG-LLM
View on GitHub
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
☆130Dec 10, 2024Updated last year
IMCCretrieval / MomentDiff
View on GitHub
MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023
☆80Nov 2, 2023Updated 2 years ago
ForeverPs / IncrementalVHD_GPE
View on GitHub
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
☆42Jan 9, 2024Updated 2 years ago
j-min / HiREST
View on GitHub
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆110Jan 23, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
RenShuhuai-Andy / TimeChat
View on GitHub
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
☆425May 8, 2025Updated last year
minjoong507 / BM-DETR
View on GitHub
[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"
☆16Feb 24, 2025Updated last year
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year
fletcherjiang / LLMEPET
View on GitHub
[MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
☆130Aug 23, 2024Updated last year
minghangz / cpl
View on GitHub
CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
☆65Mar 22, 2026Updated 4 months ago
zjuchenlong / WSAG
View on GitHub
[EMNLP'22] Weakly-Supervised Temporal Article Grounding
☆14Nov 25, 2023Updated 2 years ago
TencentARC / MindOmni
View on GitHub
[NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
☆139Oct 15, 2025Updated 9 months ago