line / Meta-AI-Video-Similarity-Challenge-3rd-Place-SolutionLinks
The 3rd Place Solution of the Meta AI Video Similarity Challenge : Descriptor Track and Matching Track.
☆22Updated 2 years ago
Alternatives and similar repositories for Meta-AI-Video-Similarity-Challenge-3rd-Place-Solution
Users that are interested in Meta-AI-Video-Similarity-Challenge-3rd-Place-Solution are comparing it to the libraries listed below
Sorting:
- The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.☆140Updated last year
- [CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Wor…☆54Updated 2 years ago
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆55Updated 2 years ago
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆128Updated last year
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Updated 4 years ago
- Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]☆42Updated last year
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆68Updated 2 years ago
- [NeurIPS Challenge Rank 1st] The codes and related files to reproduce the results for Image Similarity Challenge Track 1.☆140Updated 3 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆100Updated 2 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆104Updated last year
- Code for the Video Similarity Challenge.☆80Updated last year
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆194Updated last year
- [CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.☆19Updated 4 months ago
- ☆134Updated last year
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆140Updated 3 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆181Updated last year
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆144Updated 5 months ago
- ☆251Updated 2 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆19Updated 6 months ago
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆190Updated last year
- ☆52Updated 2 years ago
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆38Updated last year
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆67Updated last year
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Updated 2 years ago
- List of resources for video retrieval.☆19Updated 3 years ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Updated 3 years ago
- ☆29Updated 3 years ago
- mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)☆229Updated 2 years ago
- ☆117Updated last year