☆14Aug 28, 2024Updated last year
Alternatives and similar repositories for T2VIndexer-generativeSearch
Users that are interested in T2VIndexer-generativeSearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- ☆13Jan 5, 2025Updated last year
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆14Nov 8, 2023Updated 2 years ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆17Oct 4, 2025Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'☆51May 17, 2022Updated 3 years ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆40Jan 20, 2025Updated last year
- ☆20Jul 28, 2025Updated 8 months ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Sep 18, 2025Updated 6 months ago
- ☆14Jun 19, 2024Updated last year
- Tensorflow/Keras implementation of a Conditional Generative Adversarial Network (CGAN) model that can be used for image denoising or arte…☆13May 13, 2019Updated 6 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆19Aug 5, 2025Updated 7 months ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆42Sep 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Jun 10, 2025Updated 9 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated last year
- ☆17Aug 13, 2024Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Feb 28, 2023Updated 3 years ago
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆11Mar 21, 2025Updated last year
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆90Jul 2, 2024Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆68Jun 27, 2022Updated 3 years ago
- We introduce the direct document relevance optimization (DDRO) for training a pairwise ranker model. DDRO encourages the model to focus o…☆36Jan 10, 2026Updated 2 months ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- ☆14Jul 5, 2024Updated last year
- [IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks☆10Sep 2, 2024Updated last year
- This project simply uses torchvision pretrained model to finetune and classify whether an image is anime or reality☆15Aug 11, 2025Updated 7 months ago
- 【ICME2025 Oral】 Offical Pytorch Code for "Learning Dual-Domain Multi-Scale Representations for Single Image Deraining"☆18Mar 21, 2025Updated last year
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆170Oct 20, 2023Updated 2 years ago
- Documentation at☆14Mar 27, 2025Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- ☆20Jan 31, 2022Updated 4 years ago
- ☆102Sep 27, 2021Updated 4 years ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- [ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval☆26Feb 13, 2025Updated last year