starmemda / CAMoE
☆99Updated 3 years ago
Alternatives and similar repositories for CAMoE:
Users that are interested in CAMoE are comparing it to the libraries listed below
- 💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain☆265Updated 2 years ago
- Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)☆128Updated 3 years ago
- Starter Code for VALUE benchmark☆80Updated 2 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…☆130Updated 2 years ago
- Cross Modal Retrieval with Querybank Normalisation☆55Updated last year
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 2 years ago
- Graph Contrastive Clustering (ICCV2021)☆90Updated 2 years ago
- Official PyTorch implementation of the “Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation” (ICCV 2021)☆57Updated 3 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Updated 2 years ago
- A PyTorch implementation of VIOLET☆137Updated last year
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆59Updated 4 years ago
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆96Updated 2 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 4 years ago
- ☆26Updated last year
- Adversarial Attack and Defense in Deep Ranking, T-PAMI, 2024☆23Updated last year
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆53Updated 4 months ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Updated 4 years ago
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆57Updated 3 years ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Updated 2 years ago
- Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval☆96Updated 3 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- include removing batch normalize layer, calculate FLOPS and Parameters.☆18Updated 6 years ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆39Updated 3 years ago
- https://layer6ai-labs.github.io/xpool/☆122Updated last year
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆86Updated 4 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆186Updated 2 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆47Updated 2 years ago
- An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI…☆126Updated 2 years ago
- The HC-STVG Dataset☆55Updated last year
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Updated 4 years ago