ppapalampidi / GraphTPLinks
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆30Updated 4 years ago
Alternatives and similar repositories for GraphTP
Users that are interested in GraphTP are comparing it to the libraries listed below
Sorting:
- TuRnIng POint Dataset☆47Updated 6 years ago
- Screenplay Summarization using Latent Narrative Structure☆38Updated 3 years ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆21Updated 3 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 4 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 4 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆88Updated 2 years ago
- ☆53Updated 3 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆188Updated 3 years ago
- ☆33Updated 3 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Updated 3 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Updated 3 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆22Updated 2 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆162Updated 5 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- A collection of models for image<->text generation in ACM MM 2021.☆67Updated 4 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- The Document of WenLan API, which was used to obtain image and text feature.☆41Updated 2 years ago
- ☆44Updated 5 months ago
- Official code repository for the EMNLP 2021 paper☆26Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Updated 3 years ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆259Updated 3 years ago
- ☆29Updated 5 years ago
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆235Updated 4 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Updated 2 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Updated 4 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆374Updated 2 years ago