sejong-rcv / VVS
[AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression
☆20Updated 8 months ago
Alternatives and similar repositories for VVS:
Users that are interested in VVS are comparing it to the libraries listed below
- ☆21Updated last year
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆27Updated last year
- ☆33Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆45Updated 4 months ago
- [BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"☆26Updated 2 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)☆16Updated 9 months ago
- ☆36Updated last week
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆14Updated 3 months ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆48Updated 7 months ago
- [CVPR 2023] ViPLO - Official Pytorch Implementation☆39Updated last year
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆124Updated 6 months ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆36Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11Updated last year
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆17Updated 2 years ago
- ☆46Updated 9 months ago
- ☆17Updated 9 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆15Updated 3 months ago
- ☆30Updated 2 months ago
- SimOn: A Simple Framework for Online Temporal Action Localization☆18Updated 2 years ago
- ☆27Updated 2 years ago
- ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO☆13Updated 2 weeks ago
- ☆25Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆57Updated 8 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval"☆16Updated 4 months ago
- ☆16Updated 5 months ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated 9 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Updated last month