sejong-rcv/VVS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sejong-rcv/VVS)

sejong-rcv / VVS

[AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression

☆21

Alternatives and similar repositories for VVS

Users that are interested in VVS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sejong-rcv / INSANet
View on GitHub
[2024] INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection, Sensors.
☆23Mar 20, 2024Updated 2 years ago
sejong-rcv / PVLR
View on GitHub
[ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
☆13Oct 8, 2024Updated last year
sejong-rcv / SejongRCV-Indoor
View on GitHub
NAVER LABS Mapping & Localization Challenge
☆11Jul 12, 2022Updated 4 years ago
mever-team / distill-and-select
View on GitHub
Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…
☆71Apr 12, 2023Updated 3 years ago
sejong-rcv / MLPD-Multi-Label-Pedestrian-Detection
View on GitHub
[RA-L with IROS2021] Multi-Label Pedestrian Detection in Multispectral data
☆60Mar 4, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gayoungyeom / js-coding-test
View on GitHub
Javascript로 정리하는 [이것이 코딩 테스트다]
☆11Apr 11, 2023Updated 3 years ago
yyuncong / TempCLR
View on GitHub
[ICLR 2023] Temporal Alignment Representations with Contrastive Learning
☆27Apr 22, 2023Updated 3 years ago
saibr / hypvl
View on GitHub
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆21Jul 5, 2024Updated 2 years ago
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
MKLab-ITI / FIVR-200K
View on GitHub
FIVR-200K dataset from the "FIVR: Fine-grained Incident Video Retrieval" [TMM 2019]
☆81Apr 13, 2023Updated 3 years ago
snumprlab / isr-dpo
View on GitHub
Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)
☆23Nov 25, 2025Updated 8 months ago
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆23Oct 8, 2024Updated last year
kwonjunn01 / Hi-Mapper
View on GitHub
☆19Nov 29, 2024Updated last year
facebookresearch / vsc2022
View on GitHub
Code for the Video Similarity Challenge.
☆87Feb 5, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
jinhyunj / EaTR
View on GitHub
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆55Sep 7, 2023Updated 2 years ago
facebookresearch / ego-env
View on GitHub
Human-centric environment representations from egocentric video
☆15Feb 5, 2026Updated 5 months ago
neelnanda-io / neel-plotly
View on GitHub
A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…
☆15Jun 16, 2023Updated 3 years ago
xwen99 / temporal_context_aggregation
View on GitHub
(WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning
☆29Aug 4, 2021Updated 4 years ago
springkim / WSpring
View on GitHub
windows setup script
☆11Jan 22, 2023Updated 3 years ago
EasonXiao-888 / UVCOM
View on GitHub
[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
☆117Jul 17, 2024Updated 2 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
rlqja1107 / torch-ST-SGG
View on GitHub
Official PyTorch implementation Source code for Adaptive Self-Training Framework for Fine-grained Scene Graph generation (ST-SGG), accept…
☆22Jan 30, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BolinLai / CSTS
View on GitHub
[ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".
☆16Feb 24, 2025Updated last year
gyx-gloria / DMT
View on GitHub
Official Implementation of DMT: Dual Mean-Teacher in PyTorch.
☆10Oct 27, 2023Updated 2 years ago
alinlab / temporal-selfsupervision
View on GitHub
☆33Jul 28, 2022Updated 3 years ago
kanezaki / MIRO
View on GitHub
☆13Apr 16, 2018Updated 8 years ago
MRHiSum / MR.HiSum
View on GitHub
☆56Nov 1, 2024Updated last year
minghangz / SPL
View on GitHub
Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
☆16Jul 20, 2023Updated 3 years ago
hunkim / o
View on GitHub
Toy O
☆16Sep 21, 2024Updated last year
kdariina / CLIP-not-BoW-unimodally
View on GitHub
Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"
☆29Feb 27, 2026Updated 4 months ago
alipay / VCSL
View on GitHub
Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]
☆142Feb 4, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
Minglu58 / TA2V
View on GitHub
☆16Dec 1, 2025Updated 7 months ago
OpenGVLab / VKnowU
View on GitHub
[ECCV 2026] VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
☆16Feb 3, 2026Updated 5 months ago
ExplainableML / EgoCVR
View on GitHub
[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
☆41Apr 11, 2025Updated last year
stogiannidis / srbench
View on GitHub
Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"
☆19Feb 1, 2026Updated 5 months ago
tripletclip / TripletCLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"
☆48Dec 1, 2024Updated last year
MichWozPol / LEGO_StableDiffusion
View on GitHub
The project aim was to fine-tune the stable diffusion model in order to generate images in the LEGO style based on the prompt.
☆16Jun 7, 2023Updated 3 years ago