WangWenhao0716 / VSC-DescriptorTrack-SubmissionLinks
[CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.
☆19Updated 3 months ago
Alternatives and similar repositories for VSC-DescriptorTrack-Submission
Users that are interested in VSC-DescriptorTrack-Submission are comparing it to the libraries listed below
Sorting:
- Code for the Video Similarity Challenge.☆81Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆54Updated 2 years ago
- Facebook Image Similarity Challenge 2021☆19Updated 3 years ago
- ☆22Updated 3 years ago
- [CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Wor…☆53Updated 2 years ago
- Precision Search through Multi-Style Inputs☆71Updated 2 months ago
- ☆11Updated 8 months ago
- [CVPR 2022 Challenge Rank 1st] The official code for V2L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval…☆29Updated 2 years ago
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 7 months ago
- ☆29Updated 3 years ago
- 2019 CCF 大数据与计算智能大赛 视频版权检测算法 复赛第8名方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…☆30Updated 5 years ago
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆127Updated last year
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 3 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆30Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- LAEO-Net++☆20Updated 4 years ago
- CVPR’2022 Kinetics-GEBD Challenge☆10Updated 3 years ago
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Updated 3 years ago
- ☆69Updated 2 years ago
- ☆28Updated 3 years ago
- ☆87Updated last year
- Chinese CLIP models with SOTA performance.☆55Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated last year
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆28Updated last month
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆24Updated last week
- ☆19Updated 2 years ago
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago