Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆17Jul 14, 2025Updated 7 months ago
Alternatives and similar repositories for Text-Proxy
Users that are interested in Text-Proxy are comparing it to the libraries listed below
Sorting:
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- [NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acce…☆27May 14, 2024Updated last year
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Nov 5, 2023Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 9 months ago
- ☆20Jul 28, 2025Updated 7 months ago
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 6 months ago
- https://layer6ai-labs.github.io/xpool/☆134Jul 1, 2023Updated 2 years ago
- This is a sample of recommender system based on keywords from local top-ranking news and provides candidate visiting routes. The default …☆18Aug 14, 2021Updated 4 years ago
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Jun 7, 2024Updated last year
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆143Apr 9, 2024Updated last year
- ☆36Mar 28, 2024Updated last year
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Apr 11, 2025Updated 10 months ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 11 months ago
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- 由Hugo+gitpages搭建的个人博客 | 访问地址为 https://ephmeral.github.io☆10Dec 30, 2022Updated 3 years ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated last year
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- ☆10Mar 31, 2025Updated 11 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆14Sep 25, 2025Updated 5 months ago
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Nov 2, 2023Updated 2 years ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆41Nov 15, 2023Updated 2 years ago
- ☆12Dec 15, 2023Updated 2 years ago
- Customized implementation of the U-Net in PyTorch for super-resolving hyper-low-field MRI images.☆19Apr 5, 2023Updated 2 years ago
- ☆10Nov 27, 2024Updated last year
- [AAAI 2023] Official repository of "Progressive Few-Shot Adaptation of Generative Model with Align-Free Spatial Correlation"☆10Jul 4, 2023Updated 2 years ago
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Sep 2, 2025Updated 6 months ago
- [ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".☆12Dec 23, 2023Updated 2 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆30Feb 22, 2026Updated last week
- ☆12May 26, 2023Updated 2 years ago
- [NeurIPS 2024] Activating Self-Attention for Multi-Scene Absolute Pose Regression☆14Feb 24, 2025Updated last year
- ☆25Jul 23, 2025Updated 7 months ago
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆58Aug 19, 2025Updated 6 months ago
- The official code of paper "Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition through Contrastive Learning" (AAAI 20…☆30Sep 30, 2025Updated 5 months ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆55Nov 4, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated last year
- The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"☆14Sep 19, 2024Updated last year