Annusha / LIReCLinks
Learning Interactions and Relationships between Movie Characters (CVPR'20)
☆21Updated 2 years ago
Alternatives and similar repositories for LIReC
Users that are interested in LIReC are comparing it to the libraries listed below
Sorting:
- Source code of our TCSVT'22 paper Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval☆19Updated 3 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Updated 3 years ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆54Updated 2 years ago
- Repository of proposal-free temporal moment localization work☆33Updated last year
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆160Updated last year
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆72Updated 2 months ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Updated 2 years ago
- Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)☆133Updated last year
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188Updated 4 months ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆48Updated 2 years ago
- ☆15Updated last year
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆51Updated 3 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Updated 5 years ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization☆34Updated 5 years ago
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆235Updated 4 years ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆54Updated 10 months ago
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆96Updated 3 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆212Updated 5 years ago
- ☆16Updated 4 years ago
- source code of our RaNet in EMNLP 2021☆30Updated 3 years ago
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆87Updated 4 years ago
- ACM MULTIMEDIA CONFERENCE 2020☆11Updated 5 years ago
- Starter Code for VALUE benchmark☆80Updated 3 years ago
- ☆27Updated 3 years ago
- The Document of WenLan API, which was used to obtain image and text feature.☆39Updated 2 years ago
- Video Feature Extractor for S3D-HowTo100M☆29Updated 4 years ago
- An PyTorch reimplementation of bottom-up-attention models☆16Updated 4 years ago
- ☆251Updated 2 years ago
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆31Updated 4 years ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆109Updated 3 years ago