[ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
☆26Feb 13, 2025Updated last year
Alternatives and similar repositories for TempMe
Users that are interested in TempMe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆14Sep 18, 2025Updated 8 months ago
- ☆10Nov 27, 2024Updated last year
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025☆20May 8, 2026Updated last month
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 11 months ago
- ☆15Aug 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Dec 31, 2024Updated last year
- ☆12Jun 26, 2024Updated last year
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆14May 13, 2025Updated last year
- ☆14May 20, 2025Updated last year
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆17Oct 4, 2025Updated 8 months ago
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year
- Official code for infimm-hd☆16Sep 4, 2024Updated last year
- ☆15Dec 12, 2023Updated 2 years ago
- Video-Language Alignment via Spatio–Temporal Graph Transformer; ArXiv: https://arxiv.org/abs/2407.11677☆15Jul 24, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆31Dec 28, 2023Updated 2 years ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 6 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated last year
- ☆17Mar 26, 2026Updated 2 months ago
- [AAAI 2022] SECRET: Self-Consistent Pseudo Label Refinement for Unsupervised Domain Adaptive Person Re-Identification☆18Sep 11, 2022Updated 3 years ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆26Feb 2, 2025Updated last year
- A Massive Multi-Discipline Lecture Understanding Benchmark☆34Apr 20, 2026Updated last month
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"☆14Oct 18, 2024Updated last year
- AdaTriplet loss & automargin method☆20Mar 8, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sign language HamNoSys notation parsing tool.☆19Jan 14, 2023Updated 3 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆35Feb 22, 2026Updated 3 months ago
- ☆14Jun 17, 2024Updated last year
- Source code for TCSVT paper “Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval”☆21Nov 30, 2025Updated 6 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆31Oct 28, 2025Updated 7 months ago
- ☆66Jan 23, 2026Updated 4 months ago
- ☆28May 16, 2023Updated 3 years ago
- HamNoSys2SiGML is an automation system designed to receive a set of HamNoSys codes with the optional addition of its respective glosses a…☆13Mar 28, 2021Updated 5 years ago
- Richer Convolutional Features for Edge Detection model in pytorch CVPR2017☆10Dec 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆42Jul 4, 2025Updated 11 months ago
- [NeurIPS 25] Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation☆27Nov 26, 2025Updated 6 months ago
- semantic role labeling based on deep learning, implemented by tensorflow☆16Aug 20, 2018Updated 7 years ago
- Model Preparation Algorithm: a Transfer Learning Framework☆24Mar 8, 2023Updated 3 years ago
- Use naive MultiheadAttention implement to replace nn.MultiheadAttention in pytorch☆39Feb 20, 2025Updated last year
- Sequence Tagging with Cross-Lingual Transfer Learning☆16Jul 30, 2017Updated 8 years ago
- ☆14May 7, 2019Updated 7 years ago