WangWenhao0716 / VSC-DescriptorTrack-SubmissionLinks
[CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.
☆19Updated 4 months ago
Alternatives and similar repositories for VSC-DescriptorTrack-Submission
Users that are interested in VSC-DescriptorTrack-Submission are comparing it to the libraries listed below
Sorting:
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆55Updated 2 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆27Updated last year
- Precision Search through Multi-Style Inputs☆72Updated 2 weeks ago
- Code for the Video Similarity Challenge.☆81Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆52Updated last year
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆34Updated 2 months ago
- This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale P…☆25Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13Updated 2 years ago
- Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.☆15Updated last year
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆31Updated last year
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 8 months ago
- ☆53Updated last year
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- ☆87Updated last year
- Official implementation of TagAlign☆35Updated 8 months ago
- Masked Vision-Language Transformer in Fashion☆35Updated last year
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆25Updated 10 months ago
- ☆70Updated 2 years ago
- [ACM MM2025] The official repository for the RealSyn dataset☆36Updated last month
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆127Updated last year
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Updated last year
- ☆58Updated 2 years ago
- 2019 CCF 大数据与计算智能大赛 视频版权检测算法 复赛第8名方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…☆30Updated 5 years ago
- [ECCV 2024 Oral] PetFace: A Large-Scale Dataset and Benchmark for Animal Identification https://arxiv.org/abs/2407.13555☆69Updated 2 weeks ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆96Updated 9 months ago
- [NeurIPS Challenge Rank 1st] The codes and related files to reproduce the results for Image Similarity Challenge Track 1.☆139Updated 3 years ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆31Updated last month
- Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)☆66Updated last year
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆104Updated last year
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆100Updated 2 years ago