ppapalampidi / GraphTPLinks
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆30Updated 4 years ago
Alternatives and similar repositories for GraphTP
Users that are interested in GraphTP are comparing it to the libraries listed below
Sorting:
- Screenplay Summarization using Latent Narrative Structure☆38Updated 3 years ago
- TuRnIng POint Dataset☆47Updated 6 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 3 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 4 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆88Updated 2 years ago
- ☆53Updated 4 years ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆21Updated 3 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆189Updated 3 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 4 years ago
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replie…☆32Updated 4 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Updated 3 years ago
- Humor Knowledge Enriched Transformer☆31Updated 4 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆86Updated 2 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆22Updated 2 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆162Updated 5 years ago
- A collection of models for image<->text generation in ACM MM 2021.☆67Updated 4 years ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆259Updated 3 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆374Updated 2 years ago
- ☆29Updated 5 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Updated 4 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆225Updated 3 years ago
- ☆44Updated 6 months ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Updated 2 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆235Updated 4 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Updated 5 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆115Updated 3 years ago
- Using VideoBERT to tackle video prediction☆133Updated 4 years ago