[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"
☆16Feb 24, 2025Updated last year
Alternatives and similar repositories for BM-DETR
Users that are interested in BM-DETR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 6 months ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 7 months ago
- SelecMix: Debiased Learning by Contradicting-pair Sampling (NeurIPS 2022)☆13Jun 5, 2024Updated last year
- 🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"☆13Feb 1, 2023Updated 3 years ago
- Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"☆20Dec 11, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)☆20Jun 5, 2024Updated last year
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated 2 years ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆14Sep 18, 2025Updated 7 months ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".☆40Jun 9, 2025Updated 10 months ago
- Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos☆28Jun 24, 2024Updated last year
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆118Jun 9, 2021Updated 4 years ago
- A companion for the Causal Artificial Intelligence book.☆15Sep 24, 2025Updated 6 months ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACMMM 2024] An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation☆16Mar 15, 2025Updated last year
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆338Mar 9, 2026Updated last month
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆153Aug 21, 2024Updated last year
- [ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding☆375May 8, 2024Updated last year
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆16Apr 7, 2026Updated last week
- Code for Conformal Counterfactual Inference under Hidden Confounding (KDD’24)☆11Aug 30, 2024Updated last year
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆52Mar 18, 2025Updated last year
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆36Apr 17, 2025Updated last year
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆44Jul 2, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆248Aug 12, 2025Updated 8 months ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- ☆58Apr 24, 2024Updated last year
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection☆115Jul 17, 2024Updated last year
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆58Apr 2, 2025Updated last year
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal R…☆35Jan 25, 2023Updated 3 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Pape…☆57Feb 22, 2025Updated last year
- This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos☆44Nov 5, 2025Updated 5 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆41Apr 8, 2026Updated last week
- This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.☆28Mar 19, 2025Updated last year
- Official code for the NeurIPS 2025 Paper: C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction☆26Jan 27, 2026Updated 2 months ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆162May 28, 2024Updated last year
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago