ppapalampidi / GraphTP
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆31Updated 4 years ago
Alternatives and similar repositories for GraphTP:
Users that are interested in GraphTP are comparing it to the libraries listed below
- Screenplay Summarization using Latent Narrative Structure☆36Updated 2 years ago
- TuRnIng POint Dataset☆46Updated 5 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 3 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 3 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆59Updated 3 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆48Updated 2 years ago
- ☆53Updated 3 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆85Updated last year
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆21Updated 2 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Updated 3 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 4 years ago
- ☆28Updated 5 years ago
- ☆33Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆96Updated 2 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆139Updated 2 years ago
- [ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset☆90Updated last year
- Re-implementation of the work Livebot☆15Updated 4 years ago
- Humor Knowledge Enriched Transformer☆30Updated 3 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆51Updated 3 years ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆26Updated 2 years ago
- ☆44Updated 2 years ago
- multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…☆40Updated 2 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆115Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Updated 3 years ago
- Visual Storytelling with Cross-Modal Rules☆7Updated 5 years ago
- 🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)☆23Updated 3 years ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆90Updated last year