[AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression
☆20May 14, 2024Updated last year
Alternatives and similar repositories for VVS
Users that are interested in VVS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [2024] INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection, Sensors.☆23Mar 20, 2024Updated 2 years ago
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- 2025년 하계 학부연구생 사전 신청서 및 주의사항 (2025.07.01-2025.08.31)☆10Mar 10, 2026Updated 2 weeks ago
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆70Apr 12, 2023Updated 2 years ago
- ☆17Nov 29, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [ICLR 2023] Temporal Alignment Representations with Contrastive Learning☆27Apr 22, 2023Updated 2 years ago
- Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]☆43Nov 25, 2023Updated 2 years ago
- ☆57Aug 16, 2025Updated 7 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 4 months ago
- FIVR-200K dataset from the "FIVR: Fine-grained Incident Video Retrieval" [TMM 2019]☆81Apr 13, 2023Updated 2 years ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆27Feb 27, 2026Updated 3 weeks ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆49Nov 1, 2024Updated last year
- Code for the Video Similarity Challenge.☆82Feb 5, 2024Updated 2 years ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆55Sep 7, 2023Updated 2 years ago
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆13Nov 28, 2021Updated 4 years ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Aug 4, 2021Updated 4 years ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated last month
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection☆114Jul 17, 2024Updated last year
- Official PyTorch implementation Source code for Adaptive Self-Training Framework for Fine-grained Scene Graph generation (ST-SGG), accept…☆22Jan 30, 2024Updated 2 years ago
- ☆27Oct 19, 2022Updated 3 years ago
- ☆33Jul 28, 2022Updated 3 years ago
- 2019학년도 1학기 지능기전공학부 인공지능 수업☆32Jun 24, 2019Updated 6 years ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆13Feb 24, 2025Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆18Oct 7, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- Embedding language models in probability space via log-likelihood vectors☆16Oct 25, 2025Updated 5 months ago
- ☆14Apr 16, 2018Updated 7 years ago
- Lightweight Adapting for Black-Box Large Language Models☆25Feb 15, 2024Updated 2 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆135Feb 4, 2024Updated 2 years ago