☆26Jan 12, 2022Updated 4 years ago
Alternatives and similar repositories for MQVR
Users that are interested in MQVR are comparing it to the libraries listed below
Sorting:
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62May 25, 2022Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 3 years ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- ☆101Sep 27, 2021Updated 4 years ago
- ☆31Mar 24, 2022Updated 3 years ago
- ☆15Sep 16, 2021Updated 4 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆141Jul 20, 2022Updated 3 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- ☆259Dec 10, 2022Updated 3 years ago
- CLIP-It! Language-Guided Video Summarization☆75Jun 21, 2021Updated 4 years ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Aug 1, 2022Updated 3 years ago
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- Starter Code for VALUE benchmark☆80Aug 23, 2022Updated 3 years ago
- A PyTorch implementation of VIOLET☆140Dec 17, 2023Updated 2 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Nov 16, 2022Updated 3 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Jan 28, 2024Updated 2 years ago
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆58Feb 25, 2023Updated 3 years ago
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆34Oct 24, 2025Updated 4 months ago
- ☆26Aug 1, 2023Updated 2 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.☆134May 4, 2022Updated 3 years ago
- ☆181Aug 20, 2022Updated 3 years ago
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Feb 16, 2024Updated 2 years ago
- A Raspberry Pi 5-based smart facial recognition door access system integrating OpenCV face detection, servo control, and voice feedback m…☆10Oct 28, 2024Updated last year
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆70Apr 12, 2023Updated 2 years ago
- 2019 CCF 大数据与计算智能大赛 视频版权检测算法 复赛第8名方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…☆30Nov 19, 2019Updated 6 years ago
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆32Nov 30, 2023Updated 2 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆73Nov 7, 2022Updated 3 years ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆135Feb 4, 2024Updated 2 years ago