ppapalampidi / GraphTP
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆30Updated 3 years ago
Alternatives and similar repositories for GraphTP:
Users that are interested in GraphTP are comparing it to the libraries listed below
- TuRnIng POint Dataset☆46Updated 5 years ago
- Screenplay Summarization using Latent Narrative Structure☆36Updated 2 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 3 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 3 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆57Updated 3 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆138Updated 2 years ago
- A curated list of research papers in Video Captioning☆118Updated 4 years ago
- Humor Knowledge Enriched Transformer☆28Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆168Updated 2 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆50Updated 2 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆169Updated 4 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆85Updated last year
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆232Updated 3 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆187Updated 2 years ago
- ☆53Updated 3 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆113Updated 2 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆21Updated last year
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆48Updated 2 years ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆155Updated 8 months ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Updated last year
- Re-implementation of the work Livebot☆15Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 3 years ago
- PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)☆121Updated last year
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆20Updated 2 years ago
- ☆28Updated 4 years ago
- Experiments with multimodal deep learning models based on transformers☆12Updated 2 years ago