☆12May 26, 2023Updated 2 years ago
Alternatives and similar repositories for UEDVC
Users that are interested in UEDVC are comparing it to the libraries listed below
Sorting:
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 6 months ago
- [ICCV 2023] The official PyTorch implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance"☆19Sep 26, 2024Updated last year
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Jul 31, 2025Updated 7 months ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆19Mar 3, 2025Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆49Dec 10, 2025Updated 2 months ago
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆229Jan 3, 2024Updated 2 years ago
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆70Jan 13, 2026Updated last month
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆30Apr 4, 2022Updated 3 years ago
- ☆31Mar 24, 2022Updated 3 years ago
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Feb 16, 2024Updated 2 years ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago
- ☆11Dec 6, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆161May 28, 2024Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- This is a Pytorch Implementation of the DASP algorithm from the paper "Explaining Deep Neural Networks with a Polynomial Time Algorithm f…☆11Jun 12, 2020Updated 5 years ago
- [CVPR 2024] PLGSLAM☆12Nov 24, 2025Updated 3 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated last year
- C++ implementation of the Focused Ant Colony Optimization (ACO) for solving the TSP☆11Feb 26, 2022Updated 4 years ago
- Code used in the project "Leveraging data assimilation and monitoring data for improvement of crop growth estimates in protected environm…☆13Aug 11, 2024Updated last year
- ☆15Dec 2, 2025Updated 3 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆14Sep 25, 2025Updated 5 months ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- Unofficial PyTorch implementation of MapNet: An Allocentric Spatial Memory for Mapping Environments☆12Jun 4, 2020Updated 5 years ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆42Sep 23, 2021Updated 4 years ago
- ☆48Sep 22, 2023Updated 2 years ago
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 2 months ago
- Implementation of MetaQNN (https://arxiv.org/abs/1611.02167, https://github.com/bowenbaker/metaqnn.git) with Additions and Modifications …☆11Aug 8, 2018Updated 7 years ago
- Code for "Neural Retrievers are Biased Towards LLM-Generated Content"☆14Oct 18, 2024Updated last year
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 6 months ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago