ufal / MLASK
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆10Updated last year
Alternatives and similar repositories for MLASK:
Users that are interested in MLASK are comparing it to the libraries listed below
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆31Updated 2 months ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆53Updated last year
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆39Updated 3 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 3 years ago
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆15Updated 2 years ago
- Humor Knowledge Enriched Transformer☆30Updated 3 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆28Updated 2 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆75Updated last year
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆13Updated 2 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆86Updated 3 years ago
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Updated 2 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆50Updated 3 years ago
- ☆16Updated 4 years ago
- ☆22Updated last year
- The code for ICASSP23 paper "MHSCNet: A Multimodal Hierarchical Shot-aware Convolutional Network for Video Summarization"☆10Updated 7 months ago
- ☆48Updated 6 years ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆14Updated 2 years ago
- Video Feature Extractor for S3D-HowTo100M☆29Updated 3 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆38Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆26Updated 2 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆47Updated last year
- ☆16Updated 4 months ago
- Text-Image Relationships (ACL 2019)☆21Updated last year
- Reproduce of 'Weakly Supervised Coupled Networks for Visual Sentiment Analysis'☆14Updated 5 years ago
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆48Updated 2 years ago
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Updated 2 years ago
- A Video-to-Text Framework☆10Updated last year
- Towards Long Form Audio-visual Video Understanding☆13Updated 5 months ago
- Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition☆56Updated 2 years ago