ppapalampidi / GraphTPLinks
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆32Updated 4 years ago
Alternatives and similar repositories for GraphTP
Users that are interested in GraphTP are comparing it to the libraries listed below
Sorting:
- Screenplay Summarization using Latent Narrative Structure☆38Updated 3 years ago
- TuRnIng POint Dataset☆48Updated 5 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 4 years ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆21Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆187Updated 3 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 4 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆21Updated 2 years ago
- ☆16Updated 4 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆145Updated 2 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 3 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆144Updated 3 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Updated 4 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆290Updated 3 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188Updated 4 months ago
- The Document of WenLan API, which was used to obtain image and text feature.☆39Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Updated 3 years ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆259Updated 3 years ago
- ☆33Updated 3 years ago
- Condensed Movies Challenge 2021☆19Updated 3 years ago
- MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions☆169Updated last year
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Updated 2 years ago
- Using VideoBERT to tackle video prediction☆132Updated 4 years ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆160Updated last year
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆235Updated 4 years ago
- ☆76Updated 2 years ago
- ☆53Updated 3 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆162Updated 5 years ago