sejong-rcv / VVS
[AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression
☆20Updated last year
Alternatives and similar repositories for VVS
Users that are interested in VVS are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆18Updated 3 months ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆17Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆60Updated 2 months ago
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆14Updated last week
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆31Updated 2 months ago
- [ICLR 2023] Temporal Alignment Representations with Contrastive Learning☆26Updated 2 years ago
- Pytorch implementation of Twelve Labs' Video Foundation Model evaluation framework & open embeddings☆25Updated 8 months ago
- [BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"☆26Updated 3 years ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆33Updated last year
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆55Updated 10 months ago
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆41Updated 6 months ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated 2 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆20Updated 3 months ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆50Updated last year
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆40Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11Updated last year
- ☆10Updated last month
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆13Updated 2 months ago
- ☆34Updated 8 months ago
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆11Updated 2 months ago
- ICCV2023 paper on Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization☆11Updated last year
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆58Updated 11 months ago
- ☆33Updated 2 years ago
- ☆36Updated last year
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆100Updated 3 months ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Updated last year
- ☆46Updated last year
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆18Updated last month