ufal / MLASK
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆10Updated 10 months ago
Related projects: ⓘ
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Updated last year
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆14Updated last year
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆39Updated 3 years ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆50Updated last year
- ☆20Updated last year
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆30Updated 3 years ago
- ☆18Updated 3 years ago
- Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition☆53Updated last year
- ☆26Updated 2 years ago
- This repository contains the implementation of the paper -- Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment An…☆64Updated last year
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆86Updated 3 years ago
- ☆19Updated 5 months ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆49Updated 2 years ago
- Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"☆25Updated 2 years ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆14Updated last year
- Video Feature Extractor for S3D-HowTo100M☆28Updated 3 years ago
- ☆15Updated 3 years ago
- Multi-Scale Attention for Audio Question Answering☆24Updated last year
- ☆49Updated 5 years ago
- Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition☆13Updated 2 years ago
- Humor Knowledge Enriched Transformer☆28Updated 2 years ago
- CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis(MM2020)☆103Updated 3 years ago
- code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"☆73Updated last year
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆30Updated 5 months ago
- 16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)☆16Updated last year
- Reproduce of 'Weakly Supervised Coupled Networks for Visual Sentiment Analysis'☆14Updated 4 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆10Updated last year
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆73Updated 9 months ago
- MUSIC-AVQA, CVPR2022 (ORAL)☆66Updated last year
- source code for ICASSP 2022 paper: EmotionFlow: Capture the Dialogue Level Emotion Transitions☆26Updated 2 years ago