[ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
☆26Feb 13, 2025Updated last year
Alternatives and similar repositories for TempMe
Users that are interested in TempMe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 27, 2024Updated last year
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025☆17Jul 14, 2025Updated 8 months ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- ☆14Dec 31, 2024Updated last year
- ☆13Jun 26, 2024Updated last year
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 10 months ago
- ☆14May 20, 2025Updated 10 months ago
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Jun 7, 2024Updated last year
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆71Jan 13, 2026Updated 2 months ago
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year
- Official code for infimm-hd☆16Sep 4, 2024Updated last year
- [NeurIPS 2025] FastVID: Dynamic Density Pruning for Fast Video Large Language Models☆31Nov 10, 2025Updated 4 months ago
- ☆15Dec 12, 2023Updated 2 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆28Dec 28, 2023Updated 2 years ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆15Nov 20, 2025Updated 4 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"☆13Oct 18, 2024Updated last year
- ☆13Jul 11, 2018Updated 7 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆32Feb 22, 2026Updated last month
- Source code for TCSVT paper “Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval”☆18Nov 30, 2025Updated 3 months ago
- ☆66Jan 23, 2026Updated 2 months ago
- ☆28May 16, 2023Updated 2 years ago
- Keras implementation of graph convolutional networks for sequence labelling☆12Sep 21, 2018Updated 7 years ago
- Richer Convolutional Features for Edge Detection model in pytorch CVPR2017☆10Dec 23, 2021Updated 4 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆42Jul 4, 2025Updated 8 months ago
- semantic role labeling based on deep learning, implemented by tensorflow☆16Aug 20, 2018Updated 7 years ago
- DrQA with Tensorflow☆11Oct 28, 2017Updated 8 years ago
- Model Preparation Algorithm: a Transfer Learning Framework☆23Mar 8, 2023Updated 3 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Fine-grained Entity Typing / Fine-grained Entity Classification☆12Apr 19, 2018Updated 7 years ago
- ☆13Dec 23, 2021Updated 4 years ago
- (AAAI25) This is the official code repository for "MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios".☆16May 30, 2025Updated 9 months ago
- Edge Tangent Flow☆16Dec 6, 2018Updated 7 years ago
- DeepTumorVQA benchmark (9262 CT images + 395k QA pairs)☆31Jul 8, 2025Updated 8 months ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 10 months ago