line / Meta-AI-Video-Similarity-Challenge-3rd-Place-SolutionLinks
The 3rd Place Solution of the Meta AI Video Similarity Challenge : Descriptor Track and Matching Track.
☆22Updated 2 years ago
Alternatives and similar repositories for Meta-AI-Video-Similarity-Challenge-3rd-Place-Solution
Users that are interested in Meta-AI-Video-Similarity-Challenge-3rd-Place-Solution are comparing it to the libraries listed below
Sorting:
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆55Updated 2 years ago
- The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.☆141Updated last year
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆127Updated last year
- [CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Wor…☆53Updated 2 years ago
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Updated 4 years ago
- Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]☆41Updated last year
- List of resources for video retrieval.☆20Updated 3 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆104Updated last year
- Code for the Video Similarity Challenge.☆80Updated last year
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆68Updated 2 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆101Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated 2 years ago
- [NeurIPS Challenge Rank 1st] The codes and related files to reproduce the results for Image Similarity Challenge Track 1.☆140Updated 3 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆140Updated 3 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆67Updated last year
- A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.☆20Updated 3 years ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Updated 2 years ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆144Updated 6 months ago
- 2nd place solution to Google Universal Image Embedding Challenge!☆43Updated 2 years ago
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆38Updated last year
- A new video text spotting framework with Transformer☆77Updated 3 years ago
- The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.☆145Updated 2 years ago
- ☆82Updated 2 years ago
- ☆134Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆81Updated 10 months ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated last year
- ☆52Updated 2 years ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆196Updated last year
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Updated 3 years ago
- [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset☆289Updated last year