sejong-rcv / VVS
[AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression
☆20Updated 10 months ago
Alternatives and similar repositories for VVS:
Users that are interested in VVS are comparing it to the libraries listed below
- ☆23Updated last year
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- [BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"☆26Updated 2 years ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆29Updated 3 weeks ago
- Temporal Alignment Representations with Contrastive Learning☆26Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆17Updated last month
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆17Updated 2 years ago
- ☆40Updated last week
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆17Updated last month
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆38Updated last year
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆33Updated 10 months ago
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆53Updated 3 weeks ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆17Updated 2 years ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 2 years ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated 10 months ago
- ☆26Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11Updated last year
- ☆34Updated 4 months ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆53Updated 9 months ago
- ☆46Updated 10 months ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Updated 5 months ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆50Updated last year
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)☆16Updated 11 months ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆57Updated 9 months ago
- ☆16Updated last week
- [ICLR 2023] RC-MAE☆52Updated last year
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆14Updated 3 weeks ago
- A Unified Framework for Video-Language Understanding☆57Updated last year