ppapalampidi / GraphTPLinks
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆31Updated 4 years ago
Alternatives and similar repositories for GraphTP
Users that are interested in GraphTP are comparing it to the libraries listed below
Sorting:
- Screenplay Summarization using Latent Narrative Structure☆38Updated 3 years ago
- TuRnIng POint Dataset☆46Updated 5 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 4 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆21Updated 2 years ago
- ☆53Updated 3 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆21Updated 2 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆87Updated 2 years ago
- The Document of WenLan API, which was used to obtain image and text feature.☆39Updated 2 years ago
- A collection of models for image<->text generation in ACM MM 2021.☆66Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆182Updated 2 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆289Updated 2 years ago
- Humor Knowledge Enriched Transformer☆30Updated 3 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆372Updated 2 years ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆258Updated 3 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 4 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆143Updated 3 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆162Updated 5 years ago
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆35Updated 2 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆243Updated 2 months ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188Updated 3 months ago
- ☆45Updated last month
- MERLOT: Multimodal Neural Script Knowledge Models☆224Updated 3 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Updated 3 years ago
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆234Updated 3 years ago
- PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)☆125Updated 2 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆359Updated last year
- ☆158Updated 3 years ago