line / Meta-AI-Video-Similarity-Challenge-3rd-Place-SolutionLinks
The 3rd Place Solution of the Meta AI Video Similarity Challenge : Descriptor Track and Matching Track.
☆23Updated 2 years ago
Alternatives and similar repositories for Meta-AI-Video-Similarity-Challenge-3rd-Place-Solution
Users that are interested in Meta-AI-Video-Similarity-Challenge-3rd-Place-Solution are comparing it to the libraries listed below
Sorting:
- [CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Wor…☆53Updated 2 years ago
- The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.☆141Updated last year
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆129Updated last year
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆56Updated 2 years ago
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Updated 4 years ago
- Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]☆41Updated last year
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆101Updated 2 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆19Updated 7 months ago
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆68Updated 2 years ago
- [NeurIPS Challenge Rank 1st] The codes and related files to reproduce the results for Image Similarity Challenge Track 1.☆140Updated 3 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆141Updated 3 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆104Updated last year
- ☆253Updated 2 years ago
- ☆134Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆81Updated 10 months ago
- 2nd place solution to Google Universal Image Embedding Challenge!☆43Updated 2 years ago
- Research Code for Multimodal-Cognition Team in Ant Group☆167Updated 3 months ago
- Code for the Video Similarity Challenge.☆80Updated last year
- List of resources for video retrieval.☆21Updated 3 years ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆144Updated 6 months ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆197Updated last year
- The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"☆122Updated 9 months ago
- An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"☆174Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆43Updated 5 months ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆186Updated 2 years ago
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆230Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆76Updated 2 months ago
- [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset☆289Updated last year
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆106Updated last year