Lilidamowang / T2VIndexer-generativeSearchView external linksLinks
☆13Aug 28, 2024Updated last year
Alternatives and similar repositories for T2VIndexer-generativeSearch
Users that are interested in T2VIndexer-generativeSearch are comparing it to the libraries listed below
Sorting:
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆14Nov 8, 2023Updated 2 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆19Aug 5, 2025Updated 6 months ago
- ☆20Jul 28, 2025Updated 6 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated 11 months ago
- We introduce the direct document relevance optimization (DDRO) for training a pairwise ranker model. DDRO encourages the model to focus o…☆35Jan 10, 2026Updated last month
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆28Jun 21, 2024Updated last year
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆69Jun 27, 2022Updated 3 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆40Jan 20, 2025Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 5 months ago
- Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.☆14Apr 25, 2025Updated 9 months ago
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆90Jul 2, 2024Updated last year
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Documentation at☆14Mar 27, 2025Updated 10 months ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Feb 28, 2023Updated 2 years ago
- ☆170Oct 20, 2023Updated 2 years ago
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Updated this week
- This project is a demonstration of a content-based recommendation system for Spotify that leverages user's preferences and audio features…☆16Apr 4, 2023Updated 2 years ago
- ☆10Mar 31, 2025Updated 10 months ago
- quagga☆10Apr 7, 2020Updated 5 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 3 months ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 5 months ago
- [IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks☆10Sep 2, 2024Updated last year
- ☆14Sep 17, 2024Updated last year
- A matlab package for analyzing chaotic properties of time series data☆11Jun 29, 2018Updated 7 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated last month
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- 用于自动预约民政局婚姻登记处的号,限广东省民政局☆10Jun 25, 2023Updated 2 years ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- Fine-tuning Llama2-7b and other llms for categorising emails for Deutsche Bahn (German National Railways)☆13Oct 9, 2023Updated 2 years ago
- Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'☆50May 17, 2022Updated 3 years ago
- Using machine learning techniques for prediction and modelling non linear dynamic systems.☆10Jun 29, 2018Updated 7 years ago
- calculate bhattacharyya distance based on zero cross rate feature between different Gaussian model for speech emotion recognition. corpus…☆11Oct 17, 2018Updated 7 years ago
- Agentic Keyframe Search for Video Question Answering☆15Apr 7, 2025Updated 10 months ago