FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)
β35Apr 17, 2025Updated 10 months ago
Alternatives and similar repositories for FlashVTG
Users that are interested in FlashVTG are comparing it to the libraries listed below
Sorting:
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"β13Aug 22, 2025Updated 6 months ago
- π R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)β91Jul 2, 2024Updated last year
- Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Papeβ¦β55Feb 22, 2025Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"β92Mar 9, 2025Updated 11 months ago
- β16Dec 4, 2024Updated last year
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detectionβ114Jul 17, 2024Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learningβ15Dec 12, 2023Updated 2 years ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grβ¦β151Aug 21, 2024Updated last year
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrievalβ130Aug 23, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of uβ¦β25Jun 4, 2025Updated 9 months ago
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detectionβ29Sep 26, 2024Updated last year
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Groundingβ31Aug 5, 2023Updated 2 years ago
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 β¦β246Aug 12, 2025Updated 6 months ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videosβ19Mar 3, 2025Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.β19Jul 10, 2025Updated 7 months ago
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Groundingβ40Mar 18, 2025Updated 11 months ago
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Groundingβ82Dec 14, 2025Updated 2 months ago
- [NeurIPS 2021] Moment-DETR code and QVHighlights datasetβ344Apr 18, 2024Updated last year
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervisionβ12Sep 17, 2023Updated 2 years ago
- LLaVA-Next for STVGβ18Dec 5, 2025Updated 3 months ago
- [ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"β29Nov 27, 2025Updated 3 months ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023β80Nov 2, 2023Updated 2 years ago
- [ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Groundingβ376May 8, 2024Updated last year
- Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videosβ28Jun 24, 2024Updated last year
- π¦ A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.β28Jul 2, 2025Updated 8 months ago
- β15May 25, 2024Updated last year
- SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capabilityβ16May 8, 2025Updated 10 months ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Modelsβ24Jan 1, 2026Updated 2 months ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".β39Jun 9, 2025Updated 9 months ago
- β14Oct 30, 2023Updated 2 years ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrievalβ13Sep 18, 2025Updated 5 months ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)β32Mar 29, 2024Updated last year
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanβ¦β40Jan 20, 2025Updated last year
- [AAAI 2026] SIFThinker: Spatially-Aware Image Focus for Visual Reasoningβ23Dec 2, 2025Updated 3 months ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localizationβ16Jul 20, 2023Updated 2 years ago
- SAM 2++: Tracking Anything at Any Granularityβ56Dec 15, 2025Updated 2 months ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Groundingβ66Jun 28, 2024Updated last year
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"β16Feb 24, 2025Updated last year
- Transactions on Multimedia (TMM25)β19Apr 8, 2025Updated 11 months ago