ufal / MLASK
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆10Updated last year
Alternatives and similar repositories for MLASK:
Users that are interested in MLASK are comparing it to the libraries listed below
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆52Updated last year
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆31Updated 9 months ago
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Updated 2 years ago
- Multi-Scale Attention for Audio Question Answering☆28Updated last year
- Video Feature Extractor for S3D-HowTo100M☆29Updated 3 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆50Updated 2 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆28Updated 2 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆85Updated 3 years ago
- Towards Long Form Audio-visual Video Understanding☆12Updated 3 months ago
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆15Updated 2 years ago
- ☆28Updated 2 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆75Updated last year
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 3 years ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆39Updated 3 years ago
- ☆48Updated 5 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆12Updated 2 years ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆52Updated 2 months ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆13Updated 2 years ago
- CLUE code☆12Updated 2 years ago
- ☆13Updated 7 months ago
- Recent Advances in Visual Dialog☆30Updated 2 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆30Updated 4 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆46Updated last year
- Humor Knowledge Enriched Transformer☆28Updated 3 years ago
- Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition☆13Updated 2 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆13Updated last year
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆18Updated last year
- ☆15Updated 2 months ago
- ☆23Updated last year