baiyang4 / D-LSG-Video-CaptionLinks
☆27Updated 3 years ago
Alternatives and similar repositories for D-LSG-Video-Caption
Users that are interested in D-LSG-Video-Caption are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆54Updated 4 years ago
- source code of our RaNet in EMNLP 2021☆30Updated 3 years ago
- [CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.☆52Updated 2 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆90Updated 2 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Updated 4 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 3 years ago
- The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".☆19Updated 2 years ago
- Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.☆72Updated 2 years ago
- The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".☆58Updated last year
- The codes and features of the re-implementation of SIGIR 2021 work "Deconfounded Video Moment Retrieval with Causal Intervention"☆34Updated 3 years ago
- Source code of our TCSVT'22 paper Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval☆19Updated 3 years ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆29Updated 3 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Updated 2 years ago
- ☆22Updated 3 years ago
- https://layer6ai-labs.github.io/xpool/☆125Updated 2 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆69Updated 3 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆160Updated 2 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Updated 4 years ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆42Updated 3 years ago
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆118Updated 2 years ago
- ☆34Updated 3 years ago
- A Video-to-Text Framework☆10Updated last year
- Repository of proposal-free temporal moment localization work☆33Updated last year
- ☆20Updated 2 years ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization☆34Updated 4 years ago
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆31Updated 4 years ago
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆58Updated 3 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Updated 3 years ago
- Placeholder for code of BSP.☆11Updated 3 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆66Updated 3 years ago