huangmozhi9527 / GMMFormerView external linksLinks
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆20May 10, 2024Updated last year
Alternatives and similar repositories for GMMFormer
Users that are interested in GMMFormer are comparing it to the libraries listed below
Sorting:
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 5 months ago
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Nov 5, 2023Updated 2 years ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆55Nov 4, 2024Updated last year
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆42Sep 23, 2021Updated 4 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated last year
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆19Feb 16, 2024Updated 2 years ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆55Sep 7, 2023Updated 2 years ago
- The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).☆17Mar 8, 2022Updated 3 years ago
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆49Dec 10, 2025Updated 2 months ago
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆57Aug 31, 2021Updated 4 years ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆64Jun 19, 2024Updated last year
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Jun 7, 2024Updated last year
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆14Sep 25, 2025Updated 4 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- Official pytorch repository for "TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection" (AAAI 2024 Pape…☆54Feb 22, 2025Updated 11 months ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆30Apr 4, 2022Updated 3 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 2 years ago
- [ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts☆14Jan 13, 2025Updated last year
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆40Jan 20, 2025Updated last year
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- [NeurIPS 2025] FastVID: Dynamic Density Pruning for Fast Video Large Language Models☆26Nov 10, 2025Updated 3 months ago
- ☆15Aug 4, 2025Updated 6 months ago
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆16Feb 24, 2025Updated 11 months ago
- The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).☆22Aug 2, 2025Updated 6 months ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection☆114Jul 17, 2024Updated last year
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆161May 28, 2024Updated last year
- [CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes☆24Apr 28, 2025Updated 9 months ago
- ☆20Jul 28, 2025Updated 6 months ago
- source code of our MGPN in SIGIR 2022☆18Jun 8, 2022Updated 3 years ago
- [EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning☆35Oct 22, 2025Updated 3 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated 11 months ago
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆27Jul 4, 2023Updated 2 years ago
- [ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model☆140Apr 9, 2024Updated last year