starmemda / CAMoE
β96Updated 3 years ago
Related projects β
Alternatives and complementary repositories for CAMoE
- πKaleido-BERT: Vision-Language Pre-training on Fashion Domainβ263Updated 2 years ago
- Starter Code for VALUE benchmarkβ79Updated 2 years ago
- [arXiv22] Disentangled Representation Learning for Text-Video Retrievalβ91Updated 2 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021β64Updated 3 years ago
- Code and benchmarks for the Semantic Video Retrieval Taskβ54Updated 2 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discriminationβ59Updated 3 years ago
- The HC-STVG Datasetβ53Updated last year
- The codes and features of the re-implementation of SIGIR 2021 work "Deconfounded Video Moment Retrieval with Causal Intervention"β35Updated 3 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast pβ¦β125Updated 2 years ago
- An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAIβ¦β125Updated last year
- Cross Modal Retrieval with Querybank Normalisationβ53Updated 11 months ago
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"β97Updated 3 years ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localizationβ33Updated 4 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Pβ¦β60Updated last year
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioningβ79Updated 3 years ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrievalβ33Updated 3 years ago
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)β51Updated 3 years ago
- Weakly Supervised Video Moment Retrieval from Text Queriesβ42Updated 4 years ago
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"β130Updated 3 years ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrievalβ152Updated 5 months ago
- Dense Regression Network for Video Grounding (CVPR2020)β50Updated 3 years ago
- https://layer6ai-labs.github.io/xpool/β114Updated last year
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"β52Updated 3 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"β42Updated 2 years ago
- source code of our RaNet in EMNLP 2021β30Updated 2 years ago
- β231Updated last year
- A PyTorch implementation of VIOLETβ137Updated 10 months ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020β82Updated 3 years ago
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmarkβ55Updated 2 years ago
- Code for CVPR 2021 paper: Context-aware Biaffine Localizing Network for Temporal Sentence Groundingβ20Updated 3 years ago