WangWenhao0716 / VSC-DescriptorTrack-SubmissionLinks
[CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.
☆19Updated 7 months ago
Alternatives and similar repositories for VSC-DescriptorTrack-Submission
Users that are interested in VSC-DescriptorTrack-Submission are comparing it to the libraries listed below
Sorting:
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆28Updated last year
- Precision Search through Multi-Style Inputs☆73Updated 3 months ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Updated last year
- ☆72Updated 2 years ago
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆57Updated 2 years ago
- [ACM MM2025] The official repository for the RealSyn dataset☆38Updated 4 months ago
- Code for the Video Similarity Challenge.☆80Updated last year
- Chinese CLIP models with SOTA performance.☆59Updated 2 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13Updated 2 years ago
- Video dataset dedicated to portrait-mode video recognition.☆54Updated last month
- Research Code for Multimodal-Cognition Team in Ant Group☆169Updated last month
- Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)☆70Updated last year
- Official implementation of TagAlign☆35Updated 11 months ago
- This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale P…☆25Updated 2 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆102Updated 2 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆32Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆48Updated last year
- ☆19Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆32Updated 5 months ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆52Updated last year
- ☆54Updated last year
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆35Updated 5 months ago
- ☆87Updated last year
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆43Updated 11 months ago
- Facebook Image Similarity Challenge 2021☆19Updated 3 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated 2 years ago
- Toward Universal Multimodal Embedding☆67Updated 3 months ago
- ☆19Updated 2 years ago
- [CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Wor…☆54Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Updated last year